PCT 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY 



(PCT) 



(51) International Patent Classification * 
A61K 48/00, C12N 5/00, 15/00 



Al 



(It) International Publication Number: WO 97/90698 

(-13) InteroattoiialPubucation Date: 9 January 1997 (09.01.97) 



(21) International Application Number: PCT/US96V 10602 

| (22) International Filing Date: 20 June 1996 (20.06.96) 



(30) Priority Data: 
60017,814 



20 June 1995 (20.06.95) 



US 



(71) Applicant: THE GENERAL HOSPITAL CORPORATION 
rUS/US]; 55 Fruit Street, Boston, MA 021 14 (US). 

(72) Inventors: WANDS, Jack, R^ 210 Varidc Road, Wahan, MA 
1 02168 (US). SCAGUONL Pier, Paolo; 6 Sbeafe Street 

#4v Boston, MA 02113 (USX MELEGARL Margherita: 6 
Sheafe Street #4, Boston, MA 021 13 (USX 

! (74) Agent: CLARK, Paul, T4 Bsh & Richardson P.C, 225 Franklin 
Street, Boston, MA 021 10*2804 (USX 



m DCS rS?^ S ^ CS l AU > ^ fiuro P ean Patent (AT, BE, 
CH, D£ DK, ES. FL FR, GB. GR, IE, IT, LU, MC, NU 



Published 

With international search report. 



(54) Title: INHIBITION OF HEPATTTIS B REPLICATION 
(57) Abstract 

J ^>^^S^ ^vt^. eg., hepatitis B virus (HBVX by 

a fir., amino add sequence that is S^iLS tTf rS^T^^ £522*^ ^ P 01 ^** 

lacks a second amino add seaueace of the wild of a wild type hepadnavma core protein, and dtber 

amino adds of me wi^Tb^uM^ S^S^Tf^ TT^f^f including .he carboiy te m 1 inal^ 

add sameace mat is »ub S S^i^^^ c S^^ < liL£^ a ^ S. 1 * "P™"*""* "»»° acid or an amino 

add of ** surface prSeToffioined by ^^^^^^^^^ P^^o^ an.ino |* 



Best Available Copy 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AM 


AlDKSU 


GB 


United Kingdom 


MW 


Malawi 


AT 


Austria 


GE 


Georgia 


MX 


Mexico 


AU 


Australia 


GN 


Guinea 


NE 


Niger 


BB 


Barbados 


GR 




NL 


Netherlands 


BE 


Belgium 


HTJ 


Hungary 


NO 


Norway 


BF 


Burkina Fas© 


m 


Ireland 


NZ 


New Zealand 


BG 


Bulgaria 


IT 


Italy 


PL 


Poland 


BJ 


Barm 


JP 


Japan 


FT 


Portugal 


BR 


Brazil 


KB 


Kenya 


RO 


Rornania 


BY 


Belarus 


KG 


Kyigystan 


RU 


Russian Federation 


CA 


Canada 


KP 


Democratic People's RepobBc 


SD 


Sudan 


CF 


Central African Republic 




of Korea 


SE 


Sweden 


CG 


Congo 


KR 


Republic of Korea 


SG 


Singapore 


CH 


Switzerland 


KZ 


Kazakhstan 


SI 


Slovenia 


a 


Cote dlvoire 


U 




SK 


Slovakia 


CM 


Cameroon 


LK 


Sri Lanka 


SN 


Senegal 


CN 


China 


LR 


Liberia 


sz 


Swaziland 


CS 




LT 




TD 


Chad 


CZ 


Czech Republic 


LU 


Luxembourg 


TG 


Togo 


DE 




LV 


Latvia 


TJ 


Tajikistan 


DK 


Denmark 


MC 


Monaco 


TT 


Trinidad and Tobago 


EE 


Estonia 


MD 


Republic of Moldova 


UA 


Ukraine 


ES 


Spain 


MG 


Madagascar 


UG 


Uganda 


FX 


Finland 


ML 


Mali 


US 


United States of America 


FR 


France 


MS 


Mongolia 


uz 


Uzbekistan 


GA 


Gabon 


MR 


Mauritania 


VN 


Viet Nam 



17:39:34 



WO 97/00698 

PCT/US9 6/1 0602 

- 1 - 

INHIBITION OF HKPATTTTS ft REPI.TgA^ PTOW 

The invention vas funded in part by grants CA- 
357X1 and AA-08169 from the National Institutes of 
5 Health. The Government has certain rights to this 
invention. 

Background of Tnv^nf io n 

The invention relates to treating infections of a 
hepadna virus, e.g. , hepatitis B virus. 
10 Hepatitis B virus (HBV) is a member of the 

hepadnavirus family, a group of enveloped DMA viruses 
that cause acute and chronic hepatitis. Hajor clinical 
consequences of HBV infection include acute liver 
failure, liver cirrhosis, and primary hepatocellular 
15 carcinoma (HCC) . with more than 250 million individuals 
infected worldwide, effective treatment of chronic HBV 
infection is a major public health goal (Ganem et al., 
Annu. Hev. Biocfaea., ££: 651-693, 1987). Although an 
effective and inexpensive vaccine is available for 
20 preventing infection, to date there is no effective 
therapy for treating individuals with persistent 
infection, nor for reducing the risk of liver disease in 
infected patients (Maynard et al., Jtev. Infect. Dis., 11. 
S574-S578, 1989); DiBisceglie et al.. Cancer Detection 
25 and Prevention, ±A, 291-293, 1989). Current treatments 
for chronic HBV infection include interferon and other 
inhibitors of viral DNA synthesis, since these agents 
have achieved only limited success, additional antiviral 
approaches are urgently needed. 
30 Hepadnaviruses are composed of a viral envelope, a 

nucleocapsid which contains a relaxed circular 3.2 kb DNA 
^ genome, and a virally encoded reverse transcriptase. 

Following infection of a cell, virion DNA is delivered to 
the nucleus where it is converted into a covalently 
35 closed circular DNA (cccDNA) , which is in turn 
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transcribed into several subgenomic and pregenomic mRNAs. 
The pregenomic RNA is then encapsidated into the viral 
nucleocapsid, together with the reverse transcriptase 
enzyme necessary to generate the viral DNA genome (Enders 
5 et al., J. Virol., 67, 35-41, 1987). Selective 
encapsidation of pregenomic RNA depends on both 
nucleocapsid protein and on viral polymerase 
(Bartenschlager et al., J • Virol., 64, 5324-5332, 1990; 
Hirsch et al., Nature, 344, 552-555, 1990; Nassal, M. , J*. 
10 Virol., 66, 4107-4116, 1992; Roychoury et al*, J. Virol., 
65 . 3617-3624, 1991) as well as on a cis-acting 
encapsidation signal located at the 5' end of the 
pregenomic RNA (Bartenschlager et al., supra; Junker- 
Hiepmann et al., EMBO J., 9, 3389-3396, 1990; Pollack et 
15 al*, J. Virol., 67, 3254-3263, 1993. 

The mammalian hepadnavirus 21 kd core protein is a 
183-187 (depending on the viral strain) amino acid 
monomer, 180 of which self assemble into an icosahedral 
structure within the cytoplasm of infected cells. The 
20 core protein has two functional domains. The 

aminoterminus (amino acids 1 to 139-44) is essential for 
core assembly. A carboxyterminal arginine-rich region 
(amino acids 139-183, or 144-187, depending upon the 
viral strain) binds nucleic acids that are required for 
25 positive strand DNA synthesis, and stabilizes core 

particles for complete assembly of the complex into an 
enveloped viral particle (Birnbaum et al. J. Virol., &£, 
3319-3330, 1990; Yu et al., J. Virol., £5, 2511-2517, 
1990; Nassal, M. , supra). 

30 fiimmarv of the Invention 

The invention is based on Applicants' discovery 
that altering the carboxyterminus of the hepadnavirus 
core protein creates a mutant polypeptide that reduces 
replication of a wild type hepadnavirus, by a dominant 
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negative mechanism* The inhibitory effect is achieved by 
deletion of a few carboxyterminal amino acids from the 
core protein, and/ or by joining the core protein to a 
hepadnavirus surface protein, thereby creating a core- 
5 surface fusion polypeptide. 

Accordingly, the invention features a method of 
inhibiting the replication of a naturally-occurring, 
infectious hepadnavirus. The method involves introducing 
into the proximity of the hepadnavirus a hepadnavirus 

10 mutant polypeptide, or a nucleic acid that encodes such a 
hepadnavirus mutant polypeptide. The polypeptide 
includes a first amino acid sequence that is 
substantially identical to a region of a wild type 
hepadnavirus core protein, but lacks a second amino acid 

15 sequence of the wild type hepadnavirus core protein, 

wherein the second sequence includes the carboxyterminal 
three amino acids of the wild-type hepadnavirus core 
protein and does not exceed 100 amino acids in length. 
The mutant polypeptide is introduced into the infected 

20 cell, or is expressed from the nucleic acid, in the 

proximity of the naturally-occurring hepadnavirus, so as 
to be available to inhibit replication of the 
hepadnavirus • 

When the method of inhibiting hepadnavirus 

25 replication is targeted against HBV, the carboxyterminal 
amino acid of the first amino acid sequence can be 
selected from the group consisting of any of the amino 
acids between position 81 and position 180 of the 
sequence shown in Fig. 15 (SEQ ID NO: 12), inclusive; 

30 preferably the carboxyterminal amino acid is chosen from 
the group consisting of the amino acids between position 
171 and position 180 of the sequence shown in Fig. 15 
(SEQ ID NO: 12) , inclusive. A construct exemplified 
herein ends with a carboxyterminal residue at position 

35 171, so that the mutant core protein includes amino acids 
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1-171 (Fig. 15 (SEQ ID NO: 12))* In another example, the 
carboxyterminal amino acid is amino acid 178, so that the 
mutant core protein includes amino acids 1-178 (Fig. 15 
(SEQ ID NO: 12)), corresponding to a five amino acid 
5 deletion from the car boxy terminus (see, e.g., the 

analogous duck hepatitis B virus (DHBV) construct pBK, 
which is described below) . The first amino acid sequence 
is at least 70 amino acids in length, e.g., 72, 74, 76, 
78, or 80 amino acids in length. The aminoterminal amino 

10 acid of the first amino acid sequence can be the first 
amino acid of the corresponding wild type hepadnavirus 
sequence. Alternatively, nonessential aminoterminal 
amino acids can be eliminated from the mutant 
polypeptide, provided that the resulting mutant 

15 polypeptide does not lose substantial inhibitory activity 
as a result, when tested according to the methods 
described below. 

By M lacks a second amino acid sequence* 9 is meant 
that at least three ami no acids from the carboxyterminal 

20 end of the core protein have been deleted to make the 
mutant • Preferably, the deleted sequence includes amino 
acids 171-183 of the HBV core protein; i.e., the second 
amino acid sequence includes amino acids 171-183 of the 
sequence shown in Fig. 15 (SEQ ID NO: 12), inclusive* 

25 Xn another embodiment of the method of inhibiting 

hepadnavirus replication, the mutant polypeptide further 
includes a third amino acid sequence. The third amino 
acid sequence is substantially identical to a portion of 
a wild type hepadnavirus surface protein. The 

30 aminoterminal amino acid of the third amino acid sequence 
may be joined by a peptide bond to the carboxyterminal 
amino acid of the first amino acid sequence so as to 
create a fusion protein. The third amino acid sequence 
can be the entire surface protein, or can be a portion 

35 thereof, e.g., a portion of at least 4, 8, 20, 30, or 43 
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amino acids in length. For example, the aminoterminal 
amino acid of the third amino acid sequence can be 
selected from the group consisting of the amino acids 
between position 1 and position 112 of the sequence shown 
5 in Fig. 16 (SEQ ID NO: 14), inclusive , preferably the 
amino acids between position 1 and position 8, inclusive. 
Preferred aminot ermina 1 amino acids of the third amino 
acid sequence exemplified herein include, but are not 
limited to, position 5 or position 8 of Fig. 16 (SEQ XD 
10 NO: 14) . 

The carboxyterminal amino acid of the third amino 
acid sequence can be selected from a group that includes 
any of the amino acids between position 51 and position 
224 of Fig. 16 (SEQ ID NO: 14), inclusive; e.g., any of 

15 the amino acids between position 112 and position 224 of 
Fig. 16 (SEQ ID NO: 14), inclusive; e.g., the 
carboxyterminal amino acid may be position 51, position 
112, or position 224 of Fig. 16 (SEQ ID NO: 14). Thus, 
the portion of the surface protein included on the mutant 

20 polypeptide preferably includes surface protein residues 
1-112, 8-112, or 8-51, all inclusive (Fig. 16; SEQ ID NO: 
14). 

The use of a core protein for inhibiting viral 
replication is a species-specific event, so that mutant 

25 core proteins inhibit nucleocapsid assembly in the same 
type of hepadnavirus from which they were derived. Thus, 
the first amino acid sequence is substantially identical 
to a region of a wild type hepadnavirus core protein that 
is derived from the same type of hepadnavirus (e.g., HBV 

30 versus DHBV) as the naturally-occurring hepadnavirus 
targeted for inhibition. In contrast, the third amino 
acid sequence may be substantially identical to a portion 
of a wild type hepadnavirus surface protein of any 
hepadnavirus species, since the stir face proteins do not 

35 demonstrate species specificity. Thus, when the method 
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of the invention is used to treat an HBV infection, the 
mutant polypeptide should include sequences specifically 
derived from the HBV core protein Fig. 15 (SEQ ID NO: 
12) , but can include sequences derived from any species 
5 of surface protein (e.g. , the sequence of Fig. 16 (SEQ ID 
NO: 14)). 

In another embodiment , the invention features a 
nucleic acid that encodes a mutant hepatitis B virus 
(HBV) polypeptide, the polypeptide including a first 

10 amino acid sequence that is substantially identical to a 
region of a vild type HBV core protein, and lacking a 
second amino acid sequence of the vild type HBV core 
protein. The second sequence includes the 
car boxyt ermina 1 three amino acids of the vild type HBV 

15 core protein and does not exceed nine amino acids in 
length. Thus, the carboxyterminal amino acid of the 
first amino acid sequence can be at position 174, 
position 175, position 176, position 177, position 178, 
position 179, or position 180, all of Fig. 15 (SEQ ID NO: 

20 12). 

In another embodiment, the invention features a 
nucleic acid that encodes a mutant hepadnavirus 
polypeptide. The polypeptide includes a first amino acid 
sequence that is substantially identical to a region of a 

25 vild type hepadnavirus core protein; lacks a second amino 
acid sequence of the vild type hepadnavirus core protein 
vhich includes at least the carboxyterminal three amino 
acids of the vild type hepadnavirus core protein; and 
includes a third amino acid sequence that is 

30 substantially identical to a portion, or all, of a vild 
type hepadnavirus surface protein* The aminoterminal 
amino acid of the third amino acid sequence may be joined 
by a peptide bond to the carboxyterminal amino acid of 
the first amino acid sequence so as to create a fusion 

35 protein. The carboxyterminal amino acid of the first 
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amino acid sequence can be any of the amino acids between 
position 71 and position 180 of Fig* 15 (SEQ ID NO: 12), 
inclusive. Preferably, the second amino acid sequence 
does not exceed 100 amino acids in length. 
5 The invention also features polypeptides encoded 

by any of the various nucleic acids of the invention. A 
polypeptide of the invention can be included in a 
therapeutic composition as an active ingredient, along 
with a pharmaceutical^ acceptable carrier, or it can be 

10 expressed from the nucleic acid within the infected cell. 

The invention also features vectors into which are 
inserted any of the various nucleic acids of the 
invention. The vector can include any sequence known to 
those of skill in the art necessary or desirable for 

15 replicating the vector in a eukaryotic cell or for 

expressing a polypeptide of the invention from the coding 
sequences thereon. For example, the nucleic acid 
sequence can be operatively linked to appropriate 
transcription and/ or translation control sequences that 

20 function in a eukaryotic cell. The vector can be any 

vector suitable, for maintaining or making multiple copies 
of a nucleic acid of the invention, or can be one that is 
suitable for administering a nucleic acid of the 
invention to a cell or to a mammal infected with a 

25 hepadnavirus, e.g. , to a human patient infected with HBV 
or to cells removed from the patient for ex vivo gene 
therapy. Examples of vectors useful in the method of 
inhibiting a hepadnavirus include, but are not limited 
to, adenovirus vectors, adeno-associated vectors, and 

30 retroviral vectors. Any of the various vectors of the 
invention can be included in a therapeutic composition 
along with a pharmaceutical^ acceptable carrier. 

In another aspect the invention includes a method 
of evaluating a candidate polypeptide for its ability to 

35 inhibit the replication of a naturally-occurring 
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hepadna virus • The method involves introducing a mutant 
hepadnavirus polypeptide as described above into a medium 
in the presence of the hepadnavirus and determining 
whether hepadnavirus replication is inhibited in the 
5 presence of the polypeptide, compared to in its absence, 
such inhibition being an indication that the polypeptide 
is an inhibitor of hepadnavirus replication. By "medium" 
is meant an environment that is capable of supporting 
viral replication by virtue of its chemical composition* 

10 The medium can be within an organism, e.g. , an animal 
model, or can be within an organ removed from an animal. 
The medium can also be an intracellular medium, e.g. f in 
a cell culture assay, or a cell-free extract, e.g., a 
cell free replication system. Examples of cells suitable 

15 for a cell culture assay include, but are not limited to, 
Huh-6, Huh-7, HepG2 , HepG2 2215, IMH, DC, and HCC cells. 
The polypeptide can be introduced to the medium by 
introducing into the medium a nucleic acid encoding the 
polypeptide, with subsequent expression of the 

20 polypeptide therein. 

Another method of inhibiting the replication of a 
naturally-occurring hepadnavirus involves introducing 
into the proximity of the hepadnavirus a hepadnavirus 
mutant polypeptide, or a nucleic acid that encodes a 

25 hepadnavirus mutant polypeptide. The polypeptide 
includes a first amino acid sequence that is 
substantially identical to a region of, or all of, a wild 
type hepadnavirus core protein, and a second amino acid 
sequence which is substantially identical to a portion 

30 of, or all of, a wild type hepadnavirus surface protein. 
The aminoterminal amino acid of the second amino acid 
sequence may be joined by a peptide bond to the 
carboxyt ermina 1 amino acid of the first amino acid 
sequence so as to create a fusion protein. The second 

35 amino acid sequence can be the entire surface protein, or 
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can be a portion thereof. The mutant: polypeptide is 
expressed from the nucleic acid in the proximity of the 
naturally-occurring hepadnavirus, so as to be available 
to inhibit replication of the hepadnavirus. 



hepadnavirus mutant polypeptide, or a nucleic acid that 
encodes a hepadnavirus mutant polypeptide. The 
polypeptide includes a first amino acid sequence that is 
substantially identical to a region , or all, of a wild 

10 type hepadnavirus core protein ,. and a second- amino acid 
sequence which is substantially identical to a portion, 
or all, of a wild type hepadnavirus surface protein. The 
am i noterminal amino acid of the second amino acid 
sequence may be joined by a peptide bond to the 

15 carboxyterminal amino acid of the first amino acid 

sequence so as to create a fusion protein. The second 
amino acid sequence can be the entire surface protein, or 
can be a portion thereof. 



20 member of the hepadnavirus family of viruses, including, 
but not limited to, hepatitis B virus and hepatitis delta 
virus (Wang et al.. Nature, 323:508-13, 1986). Although 
treatment of HBV is an important feature of the method of 
invention due to the incidence of HBV-related human 

25 disease, the methods described herein also apply to other 
species of hepadnaviruses. Examples of hepadnaviruses 
within the scope of the invention include, but are not 
limited to, hepadnaviruses infecting various human 
organs, including liver cells, exocrine and endocrine 

30 cells, tubular epithelium of the kidney, spleen cells, 
leukocytes, lymphocytes, e.g., splenic, peripheral blood, 
B or T lymphocytes, and cells of the lymph nodes and 
pancreas (see, e.g.. Mason et al., Hepatology, 9:635-645, 
1989) • The invention also applies to hepadnaviruses 

35 infecting non-human mammalian species, such as 



5 



In a final aspect, the invention includes a 



As used herein, a " hepadnavirus H refers to a 
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domesticated livestock or household pets* In addition, 
the invention includes a method of evaluating a candidate 
mutant polypeptide for its ability to inhibit 
hepadnavirus replication. For the purposes of conducting 
5 a laboratory screening assay , a variety of hepadnavirus 
species are useful models. Examples include, but are not 
limited to, woodchuck hepatitis virus (WHV; Summers et 
al. Proc. Natl. Acad Sci. USA, 75:4533-37, 1978), duck 
hepatitis B virus (DHBV; Mason et al. J. Virol. 36:829- 
10 36, 1978), and squirrel hepatitis virus (e.g., Marion et 
al. Proc. Natl. Acad Sci. USA, 77:2941-45, 1980). 



below with reference to the sequence of HBV (Figs. 15 and 
16; SEQ ID NOs: 11-14), it is understood that the 

15 invention encompasses mutant polypeptides comprising 
corresponding amino acid segments derived from other 
hepadnavirus species. One of ordinary skill in the art 
can easily compare closely-related sequences to locate 
the analogous amino acid positions in related 

20 hepadna viruses ; the descriptions provided in Examples 2 
and 3 illustrate examples of such comparisons* 



replication is used to treat a hepadnaviral infection in 
an animal, a "naturally-occurring" hepadnavirus refers to 

25 a form or sequence of the virus as it exists in an 

animal, e.g., a natural isolate derived from an infected 
animal. In all other contexts, a "naturally-occurring" 
hepadnavirus is intended to be synonymous with the 
sequence known to those skilled in the art as the "wild 

30 type" sequence, e.g. , the wild type HBV core and surface 
protein sequences shown in Figs. 15 and 16 (SEQ ID NOs: 
11—14) • If an amino acid sequence of a core or surface 
protein of a hepadnavirus that is derived from a natural 
isolate differs from the conventionally accepted "wild 

35 type" sequence, it is understood that the sequence of the 



Although particular amino acids are referred to 



Where the method of inhibiting hepadnavirus 
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natural isolate may be "the proper comparison sequence for 
designing mutant polypeptides of the invention. 

"Sequence identity", as used herein, refers to the 
subunit sequence similarity between two nucleic acid or 
5 polypeptide molecules. When a given position in both of 
the two molecules is occupied by the same nucleotide or 
amino acid residue, e.g., if a given position in each of 
two polypeptides is occupied by serine, then they are 
identical at that position. The identity between two 

10 sequences is a direct function of the number of matching 
or identical positions, e.g., if half (e.g., 5 positions 
in a polymer 10 subunits in length) of the positions in 
two polypeptide sequences are identical, then the two 
sequences are 50% identical; if 90% of the positions, 

15 e.g., 9 of 10, are matched, the two sequences share 90% 
sequence identity. Methods of sequence analysis and 
alignment for the purpose of comparing the sequence 
identity of two comparison sequences are well known by 
those skilled in the art. By "substantially identical" 

20 is meant sequences that differ by no more than 10% of the 
residues, and only by conservative amino acid 
substitutions such as those shown in Table 1, or non- 
conservative amino acid substitutions, deletions, or 
insertions that do not appreciatively diminish the 

25 polypeptide's biological activity, e.g., an Insertion of 
amino acids at the junction of the core protein and 
surface protein sequences that has no appreciative effect 
on biological activity. "Biological activity", as used 
herein, refers to the ability of a mutant polypeptide to 

30 inhibit hepadnavirus replication, and can be measured by 
the assays described below. 

other terms and definitions used herein will be 
understood by those of routine skill in the art. For 
example, by "inhibiting the replication of" is meant 

35 lowering the rate or extent of replication relative to 
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replication in the absence of a mutant polypeptide of the 
invention. By "into proximity with the hepadnavirus" is 
meant introducing into a cell, organ, or organism which 
is infected with a naturally-occurring hepadnavirus, or, 
5 in the case of laboratory application, cotransf ection or 
co-inoculation with a wild type hepadnavirus. By 
"nucleic acid* is meant deoxyribonucleic acid (DNA) or 
ribonucleic acid (RNA) . 

The methods, nucleic acids, and polypeptides of 

10 the invention can be used to inhibit hepadnaviral 

replication in a mammal, e.g., as an effective therapy 
for treating individuals with a persistent HBV infection, 
or as a means of reducing the risk of hepatocellular 
carcinoma in an infected animal- Polypeptides of the 

15 invention can be administered to an infected animal 
either directly or by gene therapy techniques. The 
screening methods of the invention are simple, rapid, arid 
efficient assays designed to identify polypeptides with 
anti-hepadnaviral activity. 

20 Other features and advantages of the invention 

will be apparent from the following detailed description 
and from the claims. 

p-rief Description of th e Drawings 
Fig. 1 is a schematic illustration of the 
25 structural organization of "wild type" and mutant 
hepadnavirus constructs. 

Fig. 2 is an autoradiographic image of an agarose 
gel, showing a Southern blot analysis of core particle 
DNA that was extracted from HuH-7 cells five days post 
30 transfection and probed with full length 32 P-labeled WHV 
DNA. 

Fig. 3 is an autoradiographic image of an agarose 
gel, showing a Southern blot analysis of core particle 
associated viral DNA that was extracted from HuH-7 cells 



17:39:34 



WO 97/00698 



PCT/US96/10602 



- 13 - 

five days after transf action and probed simultaneously 
with the two full length 32 P-labeled WHV and HBV DMA 
probes. 

Fig. 4 is an autoradiographic image of an agarose 
5 gel, showing a Southern blot analysis of core particle 
associated viral DNA that was extracted from HepG2 cells 
five days after transfection and probed with full length 
32 P-labeled HBV DNA. 

Fig. 5 is an autoradiographic image of a 
10 polyacrylamide gel showing a RNase protection assay. 

Fig. 6 is an autoradiographic image of an agarose 
gel showing a Southern blot analysis of the anti-viral 
effect of dominant negative core mutants on "wild type* 
HBV replication during transient transfection in HuH-7 
15 cells. 

Fig. 7 is an autoradiographic image of a SDS- 
polyacrylamide gel showing a Western blot analysis of 
Hep62 cell lysates probed with anti-HBc antibodies. 

Fig. 8 is an autoradiographic image of an agarose 
20 gel showing a Southern blot analysis of the effect of a 
dominant negative core mutant on replication of HBV in 
Hep-G2 2215 cells. 

Fig. 9 is an autoradiographic image of an agarose 
gel showing a Southern blot analysis of cytosol— derived 
25 nucleocapsids from transf ected LMC cells hybridized with 
a full length DHBV DNA probe. 

Fig. 10 is an illustration of the nucleic acid 
sequence of the pCN4 plasmid insert (SEQ ID NO: 1) and 
the corresponding translated amino acid sequence (SEQ ID 
30 NO: 2) . 

Fig. 11 is an illustration of the nucleic acid 
sequence of the pHBV DN plasmid insert (SEQ ID NO: 3) and 
the corresponding translated amino acid sequence (SEQ ID 
NO: 4). 
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Fig. 12 is an illustration of the nucleic acid 
sequence of the pHBV DN AA plasmid insert (SEQ ID NO: 5) 
and the corresponding translated amino acid sequence (SEQ 
ID NO: 6) . 

5 Fig. 13 is an illustration of the nucleic acid 

sequence of the pHBV DN BB plasmid insert (SEQ ID NO: 7) 
and the corresponding translated amino acid sequence (SEQ 
ID NO: 8) . 

Fig- 14 is an illustration of the nucleic acid 
10 sequence of the pDHBV BK plasmid insert (SEQ ID NO: 9) 

and the corresponding translated amino acid sequence (SEQ 
ID NO: 10) . 

Fig. 15 is an illustration of the nucleic acid 
sequence of the HBV core protein (SEQ ID NO: 11) and the 
15 corresponding translated amino acid sequence (SEQ ID 
NO: 12) . 

Fig. 16 is an illustration of the nucleic acid 
sequence of the HBV core protein (SEQ ID NO: 13) and the 
corresponding translated amino acid sequence (SEQ ID 
20 NO: 14) . 

petailed D escription 
Applicants have observed that replication of a 
wild type hepadnavirus is reduced when it is co- 
transfected with a nucleic acid construct encoding a 

25 truncated core protein, or a core-surface fusion protein. 
The truncated core protein, alone or in combination with 
the surface protein component, has a deletion of at least 
three amino acids from the carboxyterminal end. Viral 
replication is reduced by as much as 90-95% without 

30 detectable toxic effects on the host cell. 

Const itutively expressing a HBV mutant core-surface 
fusion protein as a retroviral insert substantially 
inhibits HBV viral DNA production in cells that 
previously had continuously produced all viral 
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replicative intermediates and infectious virions. An 
adenoviral-based plasmid that encodes the sane mutant 
core-surface fusion protein also inhibits HBV replication 
following transient cotrans feet ion in HCC cells. These 
5 dominant negative effects on viral replication are 
consistent over a range of hepadnavirus species. 

Materials and Methods* 

Materials and methods useful for practicing the 
invention are described as follows: 

10 Plasmid Constructs. The parental plasmid pCMW82 

vas used to generate a series of constructs expressing 
WHV core proteins with an altered car boxytermina 1 region. 
Plasmid pCMW82 expresses the "wild type" WHV pregenome 
under the control of the cytomegalovirus immediate-early 

15 (CMV IE) promoter (Seeger et al., «7. Virol . , 63 . 4665- 
4669, 1989) • The pHBV plasmid carries the HBV pregenome 
under the control of the CMW XE promoter. These plasmids 
direct the synthesis of complete virions in tissue 
culture cells. The first nucleotide of the precore open 

20 reading frame was designated as nucleotide number 1 in 
the WHV genome. 

The structural organization of "wild type" and 
mutant WHV, HBV, and DHBV core plasmid constructs are 
depicted in Fig* 1. The white boxes represent the open 

25 reading frame (ORF) used for constructing core mutants. 
Numbers at the boundaries of each ORF refer to the amino 
acids in the "wild type" or mutant proteins. Dotted 
lines represent deleted sequences. Solid and hatched 
boxes correspond to mutant core proteins expressed from 

30 WHV and HBV, respectively. Shaded bars refer to DHBV. 
The shaded hatched bars refer to the polymerase gene. 
Except for the "wild type" constructs pCMW82 and pCMW- 
DHBV, all other constructs are incapable of replication 
because of deletions in genes that overlap the truncated 
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portions of the core protein. The * refers to a stop 
codon introduced by a frame shift mutation. 



complete digestion with the appropriate restriction 
5 enzyme. This was followed by subsequent incubation at 
30 °C for 20 min. in the presence of the Klenow fragment 
of DKA polymerase I and deoxyribonucleotide 
triphosphates, which filled in the 3' recessed DKA ends. 
Plasmids were then ligated with T4 DKA ligase. The 3 7 

10 protruding ends were filled in by incubation with the 
Klenow fragment of DKA polymerase I in the absence of 
deoxyribonucleotide triphosphates at 37 °C for 15 min* 
This eliminated protruding ends. Deoxyribonucleotide 
triphosphates were then added and incubation was carried 

15 out at 30°C for 20 min. (Sambrook et al., Molecular 
Cloning: A Laboratory Manual . Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY, 1989) . 



encoding the core protein were obtained as follows: 1) 

20 pCKl: To make the plasmid pCNl the WHV core gene was 
digested with the restriction enzyme SstX at nucleotide 
(nt) 310, incubated with Klenow DKA polymerase, and 
reclosed with T4 DKA ligase. This introduced a frame 
shift mutation at nt. 306 in the WHV core gene, thereby 

25 creating a stop codon at nt. 317. This mutation produces 
a 74 amino acid carboxyterminal truncated core protein, 
leaving intact the rest of the viral coding regions. 2) 
pCK2: To make pCN2 the WHV parental plasmid was digested 
with the restriction enzymes Bglll (nt. 601 in the core 

30 gene) and Smal, the latter being located in the 

downstream multiple cloning site of the vector. The 
intervening viral genes were separated by gel 
electrophoresis, and the DKA ends were filled in with 
Klenow DKA polymerase and ligated with T4 DKA ligase. 

35 This WHV core gene has 12 amino acids deleted at the 



The constructs shown in Fig. 1 were produced by 



Constructs containing mutations in sequences 
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carboxyterminus and is fused to a three amino acid 
heterologous extension from the plasmid vector. 3) pCN3: 
To maJce the plasmid pCN3 , the vild type plasmid pCHW82 
was digested with the restriction enzymes Bglll and SacXX 
5 (position 2983 in the WHV x gene) , the intervening viral 
DNA fragment was removed, and ends were filled in and 
ligated. The resulting plasmid construct encodes a 171 
amino acid core protein fragment fused in-frame with the 
X protein at amino acid 31. 4) pCN4 : The plasmid pCN4 

10 was produced by a Bglll-HscI (position 1826) fragment 

excised from pCMW82 and blunted by Klenov DNA polymerase. 
The plasmid was ligated to join the WHV 171 amino acid 
core protein as an in-frame fusion protein with amino 
acid 47 of the WHV small surface protein. 5) pCN5: The 

15 plasmid pCN5 was produced- by removing the DNA fragment 
SstI (pos. 306)-BspEI (pos. 519) from pCN4 , and blunting 
the ends with Klenow DNA polymerase and T4 DNA ligase. 
This introduced a WHV core in- frame deletion between 
amino acids 74 and 145. 6) Plasmid pCN6 expresses the 

20 first 171 amino acids of the WHV core protein fused in- 
frame with the HBV small surface protein at amino acid 
51. 

The HBV numbering system designates the unique 
EcoRI site as nucleotide l. Construct pHBV DN was 

25 generated by digesting pCMW82 at nt. 601 of the rare gene 
with BgllX, and blunting the DNA end with Klenow DNA 
polymerase. A second cut was performed with Pvul in the 
ampicillin resistance gene of the carrier plasmid, and 
the Bglll-Pvul DNA fragment was removed by fractionation 

30 on an agarose gel. The HBV MscX (pos. 299) -Pvul (in the 
ampicillin resistance gene of pHBV) fragment was ligated 
to the blunted Bglll-Pvul fragment. 

In order to produce an in-frame dominant negative 
construct of HBV that was similar to the pCN4 WHV 

35 construct, the pCN6 fragment from the SnaBI site (which 
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cuts in the CMV promoter of the carrier plasmid) to the 
BspEI site (pos. 519 in the WHV) was removed and 
substituted with the same SnaBI-BspEI (pos. 2327) 
fragment from pHBV. In this way, the HBV core protein 
5 was fused in-frame to amino acid 144 of the WHV core 
protein. This fragment, derived from plasmid pCN6, was 
already fused at amino acid 171 to the HBV small surface 
protein at amino acid 51. The resulting pHBV DN 
therefore encodes, in the hinge between the deleted core 

10 and surface proteins, five amino acids derived from the 
WHV core protein ( GG ARA) . These five amino acids were 
not present in the subtype HBV core protein. The 
car boxyt ermi na 1 20 amino acid of the WHV core protein are 
conserved in HBV. 

15 Two additional plasmids were derived from pHBV and 

called pHBV AA and pHBV BB. To make pHBV DN AA, pHBV was 
partially digested with the restriction enzyme Aval (nt. 
2431) , and then partially digested with Avrll (nt. 176) ♦ 
The resulting DNA ends were blunted by adding Klenow DNA 

20 polymerase and nucleotide triphosphates. The DNA ends 
were ligated with T4 ligase. The resulting plasmid pHBV 
DN AA encodes the HBV core protein fused in frame at 
amino acid 179 with the surface protein (encoded by the 
n S gene") at amino acid 8. The plasmid pHBV BB was made 

25 by performing two sequential partial digestions with the 
enzymes BgllX and BamHI. The DNA ends were ligated with 
T4 ligase. The pHBV BB plasmid expresses the HBV core 
protein fused in frame at amino acid 175 with the surface 
protein at amino acid 112. The correct design of the 

30 constructs was confirmed by restriction digest mapping 

and DNA sequence analysis. Plasmid DNAs were purified by 
the alkali lysis procedure followed by sedimentation 
through a cesium chloride— ethidium bromide density 
gradient. As a result of these viral gene manipulations 

35 the above plasmid constructs produce replication 
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defective WHV genomes. Plasmid pCNl expresses a 
truncated core protein that is unable to assemble into 
functional nucleocapsids. All other constructs contain 
inactivating deletions in the polymerase gene. 



constructed using the poly linker of the plasmid pBS 
SK(+) (Stratagene) , which allows for viral gene 
transcription from the T7 promoter to maXe a HBV-specif ic 
276 nt antisense RNA. This species, encoded by a BamHI 
10 (pes. 2906) to EcoRI (pos. l) fragment, was used in RNase 
protection experiments. The 32 P labeled riboprobe 
annealed specifically to the "wild type" pregenomic HBV 
DNA without recognizing the pHBV DN mRNA. 



15 proteins were derived from the plasmid pCHV DHBV (Wu et 
al., a. Virol., 65 . 2155-2163, 1991), which expresses the 
DHBV pregenome under the control of the CMV promoter. 
Construct pSK contains a deletion between the SphX site 
(position 2843 in the core gene; this numbering system is 

20 arbitrarily initiated with the nucleotide GAATT£ of the 
unique EcoRI site) and the KpnX site (position 1290, in 
the S gene) . The intervening fragment was separated by 
agarose gel electrophoresis. The ends of the larger DNA 
fragment were blunted by Klenow DNA polymerase and 

25 religated* This construct expresses, under the control 
of the CMV promoter, a protein composed of the first 66 
amino acids of the DHBV core protein fused in frame to 
amino acid five of the DHBV surface protein. Construct 
pBK contains a deletion between the Bglll site (position 

30 391 in the core gene) and the KpnX site (position 1290 in 
the S gene) . The intervening fragment was separated by 
agarose gel electrophoresis and the ends of the larger 
DNA fragment were filled in and blunted by the Klenow DNA 
polymerase. The ends were then religated. The resulting 

35 construct expresses, under the control of the CMV 



5 



Another plasmid, designated pRHBBE, was 



Constructs expressing DHBV dominant negative 
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promoter, a protein composed of the first 257 amino acids 
of the DHBV core protein (five amino acids are missing 
from the carboxyterminus) , fused in frame to the fifth 
amino acid of the DHBV surface protein. 
5 To make the construct pK, the pCHV DHBV was 

linearized by cutting at the Kpnl site (position 1290 of 
the S gene) • The DNA ends were blunted with the Klenow 
DNA polymerase reaction and the fragment was religated. 
The resulting construct has a frame-shift mutation so 

10 that the DHBV polymerase pK gene and the pre— S and S 
genes have a termination site a few nucleotides 
downstream from the KpnX site. The construct pK thus 
expresses, under the control of the CMV promoter, the 
full length core protein, but none of the envelope 

15 proteins apart from a truncated pre-S protein. A 

frameshift mutation that occurs in the polymerase gene 
renders the other constructs carrying the deletions 
described above replication defective. Construct pNX 
contains a deletion between the Nsil site (position 2845 

20 in the core gene) and the XhoX site (position 1212 in the 
pre-S gene) • The intervening fragment was separated by 
agarose gel electrophoresis. The ends of the larger DNA 
fragment were blunted and filled in with Klenow DNA 
polymerase, followed by religation of the fragment to 

25 itself. This construct expresses, under the control of 
the CMV promoter, the first 68 amino acids of the DHBV 
core protein fused in frame to amino acid 437 of the 
carboxyterminus of the polymerase gene. 

Retroviral constructs: The HBV core-surface 

30 fusion gene encoded by pHBV DN was PCR amplified with 
oligonucleotides containing at their 5' ends a SalX 
restriction enzyme recognition site. The antisense 
primer contained a recognizable Flag epitope (Kodak) . 
The PCR product was gel purified, digested with SalX, and 

35 cloned in the retroviral pBabe Puro vector (Morgenstem 
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et al., Nucl. Acids Res., 18:3587-96, 1990) at its Sail 
site. The design of the resulting pBP HBV DN vector was 
confirmed by sequence analysis. 

Transf actions into Hepatoma Cell Lines: Human 
5 hepatoma cells HuH-7 and HepG2 support a complete viral 
replicative cycle following transfection with a plasmid 
construct expressing the pregenomic viral UNA (Mason et 
al., Bepatology, &# 635-45, 1989). Cells were maintained 
and passaged as previously described (Wu et al. , «7. 

10 Virol. , £5, 2155-2163, 1991). Cells were transiently co- 
transf ected with plasmids expressing the mutated WHV or 
HBV core genes (described above) , together with an equal 
amount of a "wild type 11 WHV or HBV producing plasmid. 
Co-transfections were performed by the calcium phosphate 

15 technique (CaP0 4 transfection Kit , 5'-3', Boulder, 

Colorado). Briefly, 1.2 x 10 7 cells in 1O0 mm plates 
were grown for 24 hours. The medium was changed 2-4 
hours before transfecting with 10 fig of "wild type" 
virus. This produces the plasmid along with 10 ftg of 

20 each mutant construct. The precipitate was left' on the 
cells for 6-8 hours, and then the medium was replaced. 
The cells were harvested two days after transfection when 
performing RKA experiments, and five days after 
transfection when performing DNA experiments. 

25 The cell line LMH, derived from a chicken 

hepatocellular carcinoma, was used for transfection of 
the DHBV derived plasmids. This cell line supports 
higher levels of DHBV replication than do cell lines of 
human origin. Another cell line, derived from LMH and 

30 named D2, was created by stably transfecting a head-to- 
tail DHBV dimer that produces infectious DHBV particles. 
These cells were grown in DHEH and 10% FCS and 
trans f ected with the various dominant negative core 
mutant constructs as described above. 
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Infection of the HepG2 2215 Cell Line: Infection 
of the Hep62 2215 cell line by recombinant retroviruses 
was accomplished following a standard protocol for 
producing retroviral stocks and for infecting tissue 
5 culture cells (Miller et al., BlotechnlquGS , 7., 980-990, 
1989; Miller, et al. , Methods in Enzvmoloav. 217. 581, 
1993) . After infection, the cells were selected with 2 
pg/ml puromycin (Sigma) . Resistant clones were pooled 
and further expanded. 

10 Analysis of viral DMA replication. WHV and HBV 

DNA replication were assayed by Southern blot analysis of 
DNA that had been extracted from intracellular core 
particles. The procedure for isolation of core particles 
was previously described in detail (Pugh et al., J". 

15 Virol*, £2., 3513-3516, 1988). DNA fractionation on 

agarose gels was performed under alkali conditions and 
the DNA was transferred onto a nylon membrane for 
Southern blot analysis (Hybond N, Amersham International, 
Little Chalf ont, UK) . Prehybridization and hybridization 

20 reactions were carried out at 65 °C in 6X SSC solution (IX 
SSC is 150 mM NaCl, 15 mM Na 3 Citrate) , 5X Denhardt's 
solution (100X is 2% w/v BSA, 2% w/v Ficoll, 2% w/v 
polyvinyl pirollidone) , and 0.5% SDS. WHV and HBV DMAs 
were detected by hybridization with randomly primed 32 p- 

25 labeled full length WHV or HBV DNA (Multiprime DNA 

Labelling system, Amersham) • The membranes were washed 
twice for 15 min. each at 65 °c in IX SSC, 0.1% SDS, and 
were then washed once more at 65°C in 0.1X SSC, 0.1% SDS. 
The nylon membranes were then autoradiographed at -70 °C, 

30 using intensifying screens and Kodak films. Signal 
intensities on the nylon sheets were quantitated by a 
computer assisted scanning system (Ambis Quantprobe 
System version 3.0). 

Extraction and analysis of viral RNA. Total RNA 

35 was extracted from a 100 mm dish by lysis of cells in 1 
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ml of solution D (4 M guanldinium thiocyanate, 25 mM 
NaCitrate, pH 7.o, 0.5% sarcosyl, o.l M 2- 
mercaptoethanol) as described (Chomczynski et al.. Anal. 
Mochem., 168, 156-159, 1987). Encapsidated viral RNA was 
5 extracted from core particles by lysis in 200 ml of 

solution D and the volumes were adjusted accordingly as 
described (Roychoury et al., supra). Finally, to exclude 
contamination by plasmid DNA or reversed transcribed HBV 
DNA, the encapsidated viral RNA was digested with 16 U 
10 RNase-free DNase RQl DNase (Pr omega, Madison, Wl) at 37 *c 
for 15 ain., followed by phenol-chloroform extraction and 
ethanol precipitation, before undergoing the RNase 
protection assay. 

RNase protection analysis of total and 
15 encapsidated viral RNA was performed with a commercially 
available kit according to the manufacturer's 
instructions (RPA XX-Ribonuclease protection kit, Ambion 
Inc. Austin, TX) . The RNA probe was derived from the 
plasmid pRHBBE, a derivative of the pBluescript SK(+) , 
20 which includes the 280 bp HBV fragment BamHI (pos. nt 

2901)-EcoRX (pos. nt 1), oriented to produce an antisense 
RNA molecule when transcription was initiated with the 
bacteriophage T7 RNA polymerase. The RNA probe contained 
approximately 50 nt of plasmid sequences that were not 
25 protected by the HBV specific RNA. Labeled RNA was 

synthesized as follows: 0.5 /xg of pRHBBE was cut by r^™ctt 
and then transcribed by T7 RNA polymerase (Pr omega, 
Madison, WI) in the presence of a- 32 p dtp (100 jiCi at 400 
Ci/mM, New England Nuclear, Boston, HA) . The antisense 
30 RNA probe recognized pregenomic RNA and the 2.4 pre-si 
mRNA derived from "wild type 11 HBV, but did not recognize 
transcripts derived from pHBV DN. Hybridization, after 
denaturation at 95°C for 3 min., was performed in 20 j*l 
on 2 jig of total RNA or encapsidated pregenomic RNA 
35 derived from half of a 100 mm plate at 42 °C overnight in 
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a solution of 80% formamide, 100 mM NaCitxate pH 6.4, 300 
mM KaAcetate pH 6.4, and UnM EDTA. RNase digestion was 
carried out with RNase A (0.5U) and RNase Tl (20 U) at 
37»C for 30 min. Fragments protected by RNase digestion 
5 were separated on a denaturing 6* polyacrylamide gel 
(Sequagel 6%, National Diagnostics, Atlanta, GA) . 

Viral nucleoeapsid isolation and western slots. 
HepG2 cells that had been transfected with pHBV alone, 
pHBV DN together, or pHBV DN alone were lysed in 500 ml 
10 TNE, 1% NP 40. The debris was pelleted by centrifugation 
at 10,000 rpm in an Eppendorf bench top centrifuge. A 
200 /il aliquot of the clarified cell lysate was 
ultracentrifuged at 500,000 xg for 1 hour at 4«C through 
2 ml of a 20% w/v sucrose/TNE cushion using a TLA 100 
15 rotor (Beckman Instruments, Palo Alto, CA) . Under these 
conditions viral core particles were pelleted, whereas 
free core protein and soluble hepatitis Be antigen 
(HBeAg) remained in the supernatant (Zhou et al., supra). 
The pellet was resuspended in 100 pi of I^emmli sample 
20 buffer and boiled for 3 min. One-half of the sample was 
run over a 12.5% SDS-PAGE gel (Acrygel National 
Diagnostics, Atlanta, GA) . Western blotting was 
performed on an Immobilon-P membrane (Millipore Co., 
Bedford, MA) (Harlow et al.. Antibodies fl laboratory 
25 manual, cold Spring Harbor Laboratories, CSH, NY 1988) . 
After transfer the membrane was blocked for one hour in a 
solution of 5% non-fat dry milk and 0.5% Tween-20 in 
phosphate buffered saline (PBS) . HBcAg antigenicity was 
detected by incubation of the membrane with polyclonal 
30 antibodies prepared in rabbits against recombinant HBcAg 
(Dake Co., Carp inter ia, CA) at a 1:250 dilution in the 
above solution for one hour at 20»C. The filter was 
washed at 20°C in PBS, 0.5% Tween-20 with three 
successive changes of solution. Bound antibody was 
35 detected using the chemiluminescence method (ECL, 
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Amersham International, Little Chalfont, UK) using 
peroxidase labeled goat anti-rabbit antibodies. The 
filter vas exposed to Kodak films for 5*20 seconds. 

Experimental Results 
5 Inhibition of WHV DNA synthesis. WHV core mutant 

plasmids were tested- for the ability to inhibit wild type 
WHV DNA replication in HuH-7 cells. Fig. 2 shows a 
Southern blot analysis of core particle DNA that vas 
extracted from HuH-7 cells five days post transf ection 

10 and probed with full length 32 P labeled WHV DNA. Lane M 
contains 32 P 5' end labeled lambda Hindlll molecular 
weight markers. The HuH-7 cells were transf ected with: 
lane 1, pCMW82; lane 2, pCMW82 and pCN4; lane 3, pCMW82 
and pCNl; lane 4, pCMW82 and pCN2; lane 5, pCMW82 and 

15 pCN3; and lane 6, pCMW82 and pCN5. Each lane was loaded 
with one-half of the core associated viral DNA, which had 
been extracted from a 100 mm tissue culture dish of HuH-7 
cells • 

All mutant WHV core constructs suppressed "wild 
20 type* 1 WHV DNA synthesis, albeit with different 

efficiencies. The extent of inhibition varied among the 
different constructs, depending in part on the molecular 
structure of the mutant core protein. In order to 
exclude experimental variability, all transf ections were 
25 repeated several times with comparable results. The data 
represent an average of at least three independent 
experiments, cotransf ection of "wild type" pCKW82 with 
the mutant core constructs pCNl, pCN2, and pCN3 produced 
a modest inhibition of "wild type" viral DNA replication 
30 (36%, 48%, and 12%, respectively). In contrast, pCN4 and 
pCN5 substantially inhibited WHV DNA synthesis in HuH-7 
cells by 90% and 85%, respectively (Fig. 2). 

To test whether the pCN4 construct inhibits HBV 
replication, cotransf ection experiments were performed 
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with "wild type 11 pHBV. There was no reduction of HBV 
synthesis by the WHV based construct pCN4. Fig* 3 shows 
a Southern blot analysis of core particle associated 
viral DNA extracted from HuH-7 cells five days after 
5 transfection. The blots were probed simultaneously with 
full length 32 P labeled WHV and HBV DNA probes. Lane M 
contains 32 P 5' end labeled lambda Hindlll molecular 
weight markers. Core particle associated viral DNA was 
extracted from cells transfected with: lane 1, pCMW82; 

10 lane 2, pCMW82 and pCN4; lane 3, pCMW82 and pCN6; lane 4, 
pHBV; lane 5, pHBV and CN4; and lane 6, pHBV and pCN6. 
Each lane was loaded with one-half of the core particle 
associated DNA that had been extracted from a 100 mm 
tissue culture dish of HuH-7 cells. 

15 Xn order to determine the general region of the 

fusion protein that was responsible for . inhibiting viral 
replication, a chimeric construct expressing WHV core-HBV 
small surface fusion protein was produced. This plasmid, 
designated pCN6, reduced "wild type" WHV replication by 

20 85%, an inhibitory effect comparable to the original 
parental construct pCN4. Like pCN4, pCN6 does not 
inhibit HBV replication (Fig 3, lane 6) . It was 
concluded that the WHV core-small surface fusion protein 
encoded by pCN4 exerts a species-specific inhibitory 

25 effect. 



interfere effectively with WHV replication, HuH-7 cells 
were co-transf ected at various ratios of CMW82 to pCN4 
using 10 of pCMW82 . The total amount of transfected 
30 DNA was kept constant (20 /tg) by adding unrelated 
sonicated salmon sperm DNA. The results of these 
experiments indicate that when pCN4 was diluted by 10 and 
50 fold, there was still a 66% and 20% inhibition of 
"wild type" WHV replication, respectively. Interference 



To determine the amount of pCN4 required to 



17:39:34 





WO 97/00698 



PCT/US96/10602 



27 



with viral replication occurs even in the presence of an 
excess of "wild type* core protein* 



toxic to SCC cells* To insure that the mutant plasmids 
5 were neither affecting the efficiency of DNA uptake by 
HuH-7 cells during transf ection , nor inducing a 
cytopathic effect, each 100 mm plate had a 10 mm cover 
slip containing cells grown under the same conditions. 
The cells were investigated by imraunocyt ochemi s try 

10 utilizing the protocol of Jilbert et al. (*7. Virol. , 66, 
1377-1378, 1992) . Core protein expression was detected 
with polyclonal antibodies prepared against either WHV or 
HBV recombinant core proteins. Approximately one percent 
of the HuH-7 cells were transf ected with the ••wild type 1 * 

15 WHV plasmid, as demonstrated by diffuse cytoplasmic 
staining for WHcAg in cells harvested five days post 
transf ection. After transf ection of cells with pCK4 
alone, a punctate distribution of WHcAg in the 
perinuclear region was observed. The same staining 

20 pattern was obtained when the dominant negative core 
mutant constructs were co-trans f ected with ••wild type* 
pGMW82. The total number of HBcAg positive cells did not 
vary under these conditions. The mutant core expressing 
plasmids did not inhibit "wild type" viral DNA uptaXe 

25 during the transf ection process and were not toxic to 
HuH-7 cells. 



that the inhibitory effect exerted by pCN4 on WHV 
replication was the result of decreased transcription of 

30 "wild type " WHV pregenomic RNA. For these studies, 

Poly (A) *RNA was extracted from HuH-7 cells that had been 
transf ected with the plasmids pCMW82 alone, pGMW82 and 
pCN4 together, or pCN4 alone. The UNA was probed with a 
Bglll-BstXI WHV DNA fragment that specifically recognized 

35 the pregenomic RNA but not the pCN4 transcripts. The 



Dominant negative core mutant polypeptides are not 



It was also necessary to exclude the possibility 
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results demonstrated no change in the level of "wild 
type" WHV pregenomic KNA transcription from pCHW82 in the 
presence of pCN4. 

Inhibition of HBV replication* Based on the 
5 previous results, it was of interest to determine whether 
a similar mutant core polypeptide would inhibit HBV 
replication in HCC cells. The construct pHBV DN was 
designed to be the molecular HBV-derived equivalent of 
pCN4. Plasmid pHBV DN was co-transfected with "wild 

10 type" pHBV into HuH-7 and HepG2 cells. It inhibited 
"wild type" HBV DNA. replication by 90% (Fig. 4). 

Fig. 4 shows a Southern blot analysis of core 
particle associated viral DNA extracted from Hep62 cells 
five days after trans feet ion. The blot was probed with 

15 full length 32 P labeled HBV DNA. Lane M contains 32 P 5' 
end labeled lambda HindXIX molecular weight markers. 
Lane 1 contains 3.2 kb linear HBV DNA (10 pg) . The 
remaining lanes show core particle associated viral DNA 
extracted from cells transfected with pHBV (lane 2); or 

20 PHBV and pHBV DN (lane 3). 

The constructs pHBV DN AA and pHBV DN BB were 
assayed in the same manner, for the purpose of mapping 
which regions of the core protein and of the surface 
protein were necessary for inhibition. The construct 

25 pHBV DN AA was at least as potent an inhibitor as pHBV 
DN r whereas pHBV DN BB was less inhibitory than pHBV DN. 
This is shown in Fig. 5, which is a Southern blot 
analysis illustrating the antiviral effects of the pHBV 
DN AA and pHBV DN BB dominant negative core mutants on 

30 "wild type" HBV replication during transient transaction 
experiments in HuH-7 cells. The pCHV-HBV lane shows the 
level of "wild type" HBV replication in HUH-7 cells. The 
dominant negative mutant pHBV-DN reduced wild type 
replication by 95*. When this construct was placed in a 

35 vector containing the adenovirus sequences necessary for 
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producing a recombinant; adenovirus vector (Ad HBV DN) , 
there was an 80% decrease in HBV replication. When the 
HBV DN construct was placed in a retroviral vector (e.g., 
pBP HBV DN) , there was a 90-95% reduction in HBV 
5 replication • 

Experiments were then performed to assess the 
presence and amount of pregenomic UNA within 
nucleocapsids, and to compare these results to the level 
of viral RNA present in the cytosolic fraction by means 

10 of a sensitive RNase protection assay (Fig. 6) . una was 
extracted from HepG2 transfected cells and probed with a 
32 P labeled 322 nt RNA probe containing the BamHI (pos. 
2906)-EcoRI (pos. 1) fragment (lane P) , or 
electrophoresed on a 6% denaturing PAGE gel after RNase A 

15 and Tl digestion. Lane 1 contains 2 fig of total RNA from 
HepG2 cells transfected with pHBV; lane 2 contains 2 fig 
of total RNA from Hep62 cells transfected with pHBV and 
pHBV DN; lane 3 contains 2 fig of total RNA from HepG2 
cells transfected with pHBV DN alone (the BamHX— EcoRX 

20 fragment is missing in this construct) . The remaining 

lanes show RNA that was extracted from HepG2 -derived core 
particles and then probed as in lanes 1-3 . Each lane was 
loaded with half of the core associated RNA extracted 
from a 10 cm dish. Lane 4 contains core particle 

25 associated RNA from cells transfected with pHBV. Lane 5 
contains core particle associated RNA from cells 
transfected with pHBV and pHBV DN. Lane 6 contains core 
particle associated RNA from cells transfected with pHBV 
DN alone. There was a 90% reduction in encapsidation of 

30 "wild type* 9 pregenomic RNA when pHBV DN was co- 
transf ected with the wild type HBV DNA expressing 
plasmid r whereas no significant reduction in viral RNA 
was observed in experiments performed with total cellular 
RNA. The riboprobe used in this experiment protects 
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pregenomic and pre-Sl transcripts, both of which were 
absent in the pHBV DK transfected cells. 

"Wild type" pregenomic viral RNA is incapable of 
being encapsidated in the presence of mutant core protein 
5 because of inefficient core particle assembly. Cell 
lysates derived from HepG2 cells previously transfected 
with pHBV alone, pHBV and pHBV DK together , and pHBV DN 
alone were sedimented on a 20% w/v sucrose cushion for 
one hour at 500,000 g. Under these experimental 

10 conditions non-part iculate core protein and HBeAg were 
left in solution (Zhou et al., supra). The pellet was 
analyzed for core protein by 12.5% SDS-PAGE 
electrophoresis, and analyzed on a Western blot using 
polyclonal anti-HBc antibodies as probes (Fig. 7) . The 

15 viral core particles were derived from: lane 1, cells 
transfected with pHBV; lane 2, cells transfected with 
pHBV and pHBV DN; lane 3, cells transfected with pHBV DN 
alone; lane 4, HepG2 2215 cells (positive control). 
Lane 5 contains 100 fxg of cell lysate in RXPA buffer not 

20 subjected to ultracentrif ugation and extracted from HepG2 
2215 cells to show enrichment of core particles by the 
pelleting technique (positive control) . The protein in 
lane 6 was derived from the pellets of untransf ected 
Hep62 cells (negative control). A protein band of 21.5 

25 )cd, corresponding to the intact "wild type" HBV core 
protein, was detected only in the pellet derived from 
HepG2 cells transfected with pHBV. In the pellet of 
cells transfected with the pHBV DK plasmid, an 
immunoreactive core protein band of 11.5 kd was detected. 

30 This protein was substantially smaller than the predicted 
size of the full length core-surface fusion protein 
derived from the pHBV DN (about 3d kd) . 

To determine whether the HBV core dominant 
negative mutant HBV DN can make hepatoma cell lines 

35 resistant to HBV replication, the HBV DN coding sequence 
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was cloned into the retroviral vector pBabe Puro (pBP) , 
which contains a puromycin selectable marker* The 
resulting vector is named pBPHBV DN. Recombinant 
retroviral stocks were obtained after transf ecting pBPHBV 
5 DN into the packaging cell line PA317. The stocks were 
then used to infect KepG2 and HepG2 2215 cell lines. The 
HepG2 2215 cells constitutive ly produce wild type HBV 
virions due to the stable integration of a head to tail 
dimer of HBV. Pools of stably transduced clones were 

10 grown in the presence of puromycin. HBV DNA was purified 
from the core particles and analyzed by Southern blot* 
HepG2 2215 transduced by the pBP HBV DN vector showed a 
90% reduction in HBV replication when compared to Hep62 
2215 cells transduced by the pBP vector (Fig. 8) . This 

15 result demonstrates a striking reduction of HBV 

replicative intermediates in core particles, even in a 
cell line that constitutively expresses all the viral 
gene products and replicative forms of the virus. 

The Flag tagged dominant negative form of the HBV 

20 DN sequence was also cloned into the adenoviral vector 
pAdBgllX to generate the vector pAdHBV DN. This vector 
contains a multiple cloning site flanked by the GMV EX 
promoter and by adenovirus 5 sequences. The adenovirus 5 
sequences allow homologous recombination and 

25 reconstitution of a recombinant replication incompetent 
adenovirus after cotransf ection in 293 cells (Graham et 
al., the Human press. Vol 7 . 109-128, 1991). The plasmid 
pAdHBV DN was then introduced, along with pHBV, into HCC 
cells by transient transf ection, inhibiting HBV 

30 replication by 80% (Fig. 5),. Adenoviral vectors such as 
pAd HBV DN can be used to generate a replication 
incompetent adenovirus by homologous recombination, and 
can express the HBV DN polypeptide in the liver. 

Inhibition of DHBV replication Substantial 

35 suppression of DHBV replication was obtained by co- 
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transfecting pCMV DHBV with, the plasmid pBK. The pBK 
plasmid encodes a DHBV core protein which lacks the last 
carboxyterminal five amino acids, fused to a surface 
protein which lacks the aminoterminal first four amino 
5 acids. When shorter core fragments were fused in frame 
to a surface protein lacking the aminoterminal first four 
amino acids (plasmid pSK) , to the Pol gene product (pNX) , 
or to the pre-s gene product (pSK) , there was little or 
' no effect on DHBV replication. This result indicated 

10 that both the core protein and the surface protein 
extension were important for exerting an inhibitory 
effect on "wild type" DHBV replication, presumably by 
disrupting nucleocapsid assembly. The core portion of 
the chimeric mutant polypeptide interacts with the wild 

15 type core protein, preventing formation of intact 
nucleocapsids and thus encapsidation of the DHBV 
pregenome. A construct that expressed only the DHBV core 
protein (pK) was incapable of inhibiting DHBV 
replication, while a plasmid that expressed the same core 

20 portion as the pBK plasmid but fused to the polymerase 
gene was incapable of inhibiting "wild type" DHBV 
replication. Fig. 9 is a Southern Blot analysis of 
cytosolic derived nucleocapsid DNA from transf ected EMC 
cells, hybridized to a full length DHBV DNA probe. LMC 

25 cells were transf ected with 10 pg of pCMV DHBV together 
with 10 fig of mutant plasmids pSK (lane 2), pBK (lane 3), 
pSK (lane 4, the same as lane 2), pK (lane 5), or pNX 
(lane 6) . The last lane contains the cytosolic derived 
nucleocapsid DNA from a LMC cell line stably transfected 

30 with a head-to-tail DHBV dimer as a positive control* 
Replication of "wild type* 1 DHBV was inhibited by the 
dominant negative core mutant construct BK. 
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Therapeutic Use 

The mutant polypeptides of the invention can be 
provided exogenous ly to a target cell of an animal 
suspected of harboring a hepadnavirus infection by any 
5 appropriate method, for example by oral, parenteral, 

transdermal, or transmucosal administration. The mutant 
polypeptide can be administered in a sustained release 
formulation using a biodegradable biocompatible polymer, 
or by on-site delivery using micelles, gels or liposomes. 
10 Therapeutic doses can be, but are not necessarily, in the 
range of 0*01 - 100*0 mg/kg body weight, or a range that 
is clinically determined to be appropriate by those 
skilled in the art. 



15 invention, or as candidate agents in a method of the 

invention, can be purified using conventional methods of 
protein isolation known to one skilled in the art. These 
methods include, but are not limited to, precipitation, 
chromatography, immunoadsorption, or affinity techniques 

20 (see, e.g.. Scopes, R. protein Purification; PyApejU^es 
and Practice . 1982 Springer Verlag, NY) * The polypeptide 
can be purified from starting material that is derived 
from a genetically engineered cell line. One useful 
method of purification involves expressing the 

25 polypeptide as a fusion protein encoded by a glutathione- 
s-transferase vector, purifying the resulting fusion 
protein by GST— GSH affinity chromatography, and removing 
the GST portion of the fusion polypeptide by thrombin 
cleavage. Alternatively, a synthetic mutant polypeptide 

30 can be prepared by automated peptide synthesis (see, 

e.g., Ausubel et al., eds. Current Protocols in Mo lecular 
Bioloov . John Wiley & Sons, publ. NY. 1987, 1989; 

Sambrook et al. (1989), Molecular Cloning; A Laboratory 

Manual (2d fed.), CSH Press). 



The polypeptides useful in a method of the 
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Therapeutic administration of a mutant polypeptide 
can also be accomplished using gene therapy techniques. 
A nucleic acid that included a promoter operatively 
linked to a sequence encoding a polypeptide of the 
5 invention is used to generate high-level expression of 
the polypeptide in cells transfected with the nucleic 
acid. Gene transfer can be performed ex vivo or in vivo. 
To administer the nucleic acid ex vivo, cells can be 
removed from the body of the patient, transfected with 

10 the nucleic acid encoding the mutant polypeptide, and 
returned to the patient's body. Alternatively the 
nucleic acid can be administered in vivo, by transfecting 
the nucleic acid into target cells (e.g., hepatocytes) so 
that the mutant polypeptide is expressed in situ. 

15 The nucleic acid molecule is contained within a 

non-replicating linear or circular DNA or RNA molecule, 
or within an autonomously replicating plasmid or viral 
vector, or may be integrated into the host genome. Any 
vector that can trans feet a cell can be used in the 

20 methods of the invention. Preferred vectors are' viral 
vectors, including those derived from replication- 
defective hepatitis virus (e.g., HBV and HCV) , retrovirus 
(see, e.g., WO89/07136; Rosenberg et al., N. Eng. J. Med. 
323(9) : 570-578, 1990; Miller et al., 1993 supra), 

25 adenovirus (see, e.g., Morsey et al., J. Cell." Biochem. , 
Supp. 17E, 1993; Graham et al., in Murray, ed. , Methods 
in Molecular Biology; Gene Transfer and Expression 
Protocols. Vol. 7, Clifton, NJ: the Human Press 1991: 
109-128), adeno-associated virus (Kotin et al., Proc. 

30 Natl. Acad. Sci. USA 87:2211-2215, 1990), replication 

defective herpes simplex virus (HSV; Lu et al. , Abstract, 
page 66, Abstracts of the Meeting on Gene Therapy, Sept. 
22-26, 1992, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York) , and any modified versions of these 

35 vectors. Other preferred viral vectors include those 
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modified to target a specific cell type (see, e.g., Xan 
et al. WO 93/25234; Kasahara et al. Science, 266:1373-76, 
1994; Doraburg et al. WO 94/12626; Russell et al. WO 
94/06920) • Methods for constructing expression vectors 
5 are veil known in the art (see, e.g. , Molecular Clonings 
A Laboratory Manual, Sambrook et al., eds . , Cold Spring 
Harbor Laboratory, 2nd Edition, Cold Spring Harbor, New 
York, 1989) . 

Appropriate regulatory sequences can be inserted 

10 into the vectors of the invention using methods known to 
those skilled in the art, e.g. , by homologous 
recombination (Graham et al., jr. Gen. Virol . 36; 59-72, 
1977), or by other appropriate methods (Sambrook et al., 
eds., supra). Promoters are inserted into the vectors so 

15 that they are operatively linked 5' to the nucleic acid 
sequence encoding the mutant polypeptide. Any promoter 
that is able to initiate transcription in a target cell 
can be used in the invention. " For example, non-tissue 
specific promoters, such as the cytomegalovirus 

20 (DeBemardi et al., Proc. Natl. Acad. sci. USA 86:9257- 
9261, 1991, and references therein) , mouse 
metallothioxiine I gene (Hammer, et al., J. Mol. Appl. 
Gen. 1:273-288, 1982), HSV thymidine kinase (McKnight, 
Cell, 31:355-365, 1982), and SV40 early (Benoist et al., 

25 Nature, 290:304-310, 1981) promoters may be used. 
Preferred promoters for use in the invention are 
hepatocyte— specif ic promoters, the use of which ensures 
that the mutant polypeptides are expressed primarily in 
hepatocytes. Preferred hepatocyte-specif ic promoters 

30 include, but are not limited to the albumin, alpha- 
fetoprotein, alpha-l-antitrypsin, retinol-binding 
protein, and asialoglycoprotein receptor promoters. 
Additional viral promoters and enhancers, such as those 
from herpes simplex virus (types I and II), hepatitis 

35 virus (Types A, B, and C) , and Rous sarcoma virus (RSV; 
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Fang et al., Hepatology 10:781-787, 1989), can also be 
used in the invention. 

The mutant polypeptides of the invention, and the 
recombinant vectors containing nucleic acid sequences 
5 encoding them, may be used in therapeutic compositions 
for preventing or treating HBV infection. The 
therapeutic compositions of the invention may be used 
alone or in admixture, or in chemical combination, with 
one or more materials, including other mutant 

10 polypeptides or recombinant vectors, materials that 

increase the biological stability of the oligonucleotides 
or the recombinant vectors, or materials that increase 
the ability of the therapeutic compositions to penetrate 
hepatocytes selectively. The therapeutic compositions of 

15 the invention can be administered in pharmaceutically 
acceptable carriers (e.g. , physiological saline) , which 
are selected on the basis of the mode and route of 
administration, and standard pharmaceutical practice. 
Suitable pharmaceutical carriers, as well as 

20 pharmaceutical necessities for use in pharmaceutical 

formulations, are described in Remington 's Pharmaceutical 
Sciences, a standard reference text in this field. 

The therapeutic compositions of the invention can 
be administered in dosages determined to be appropriate 

25 by one skilled in the art. An appropriate dosage is one 
which effects a reduction in a disease caused by HBV 
infection. It is expected that the dosages will vary, 
depending upon the pharmacokinetic and pharmacodynamic 
characteristics of the particular agent, and its mode and 

30 route of administration, as well as the age, weight, and 
health (including renal and hepatic function) of the 
recipient; the nature and extent of the disease; the 
frequency and duration of the treatment; the type of, if 
any, concurrent therapy; and the desired effect. It is 

35 expected that a useful dosage contains between about 0.1 
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to 100 mg of active ingredient per kilogram of body 
weight* Ordinarily a dosage of 0.5 to 50 mg, and 
preferably, l to 10 mg of active ingredient per kilogram 
of body weight per day given in divided doses, or in 
5 sustained release form, is appropriate. 

The therapeutic compositions of the invention may 
be administered to a patient by any appropriate mode, 
e.g., parenteral ly , as determined by one skilled in the 
art. Alternatively, it may by necessary to administer 

10 the treatment surgically to the target tissue. The 

treatments of the invention may be repeated as needed, as 
determined by one skilled in the art. 

The invention also includes any other methods 
which accomplish in vivo transfer of nucleic acids into 

15 target cells. For example, the nucleic acids may be 
packaged into liposomes, non-viral nucleic acid-based 
vectors, erythrocyte ghosts, or microspheres 
(microparticles; see, e.g., U.S. Patent No. 4,789,734; 
U.S. Patent No. 4,925,673; U.S. Patent No. 3,625,214; 

20 Gregor iadis , Drug Carriers in Biology and Medicine, pp. 
287-341 (Academic Press, 1979)). Further, delivery of 
mutant polypeptides be accomplished by direct injection 
of their nucleic acid coding sequences into target 
tissues, for example, in a calcium phosphate precipitate 

25 or coupled with lipids, or as "naked DNA" . 

Mutant core polypeptides and core-surface fusion 
proteins of the invention can be tested for their ability 
to inhibit hepadnavirus replication in an animal model. 
For example, candidate polypeptides can be injected into 

30 an animal that is infected with a hepadnavirus, e.g., a 
woodchuck, duck, or ground squirrel harboring its 
respective hepatitis B virus variants (see, e.g. , Mason 
et al., «7. Virol. 36:829, 1980; Schodel et al., in 
Molecular Biology of hepatitis B virus. CRC press, Boca 

35 Raton, p. 53-80, 1991; Summers et al., Proc. Natl* Acad. 
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Sci. USA, 75:4533-4537, 1978). Candidate polypeptides 
can also be analyzed in transgenic animal strains 
developed for the purpose of studying hepadnaviral gene 
expression (see, e.g., Babinet et al.. Science, 230:1160- 
5 63, 1985; Burk et al., J. Virol. 62:649-54, 1988; Chisari 
et al., Science 230:1157-60, 1985; Chisari, in Current 
Topics i n Microbiology and Immunology- p. 85-101, 1991). 
Candidate polypeptides of the invention can also be 
tested in animals that are naturally infected with HBV, 
10 e.g., in chimpanzees, by administering the polypeptides, 
or the nucleic acids encoding them, to the animal by one 
of the methods discussed above, or by other standard 
methods . 



Other Embodiments 
15 From the above description, one skilled in the art 

can easily ascertain the essential characteristics of the 
present invention, and without departing from the spirit 
and scope thereof, can make various changes and 
modifications of the invention to adapt it to various 
20 usages and conditions. 

All publications cited herein are fully 
incorporated by reference in their entirety. 

Other embodiments axe within the claims set forth 

below. 
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TABLE 1 



CONSERVATIVE AMINO ACID REPLACEMENTS 



For Amino Acid 


Code 


Replace With 


Alanine 


A 


D-Ala, Gly, Aib, £-Ala, Acp, L-Cys, D-Cys 


Arginine 


R 


D-Arg, Lys, D-Lys , homo-Arg, D-homo-Arg, 
Met, lie, D-Met, D-Xle, Ora, D-Orn 


Asparaglne 


N 


D— Aon, Asp, D-Aep, Glu, D-Glu, Gin, D— Gin 


Aspartic Acid 


D 


D-Asp, D-Asn, Aan, Glu, D-Glu, Gin, D-Gln 


Cysteine 


C 


D— Cys, s— Me— eye. Hex, D— Met , Tnr, D— Tnr H 


Glutamine 


Q 


D-Gln, A8n, D-Asn, Glu, D-Glu, Asp, D-Asp 


H GltltftffllC Acid 


B 


D— Glu, D— Asp, Asp, Asn, D— Asn, Gin, D— Gin 


I Glycine 


6 


Ala, D— Ala, Pro, D-Pro, Alb, 0-Ala, Acp 


1 Zeoleucine 


X 


D-Ile, Val, D-Val, AdaA, AdaG, Leu, D-Leu, 
Met, D-Met 


1 Leucine 


L 


D-Leu, Val, D-Val, AdaA, AdaG, Leu, D— Leu, 
Met , D— Met 


Lysine 


K 


D-Lys, Arg, D-Arg, homo-Arg, D-homo-Arg, 
Met, D— Met, lie, D-Ile, Qrn, D-Om 


Methionine 


M 


D-Met, S-Me-Cys, lie, D-Ile, Leu, D— Leu, 
Val, D-Val 


Phenylalanine 


P 


D— Phe, Tyr, D-Thr, L-Dopa, Bis, D— His, 
Trp, D-Irp, Trans— 3 , 4 , or 5— pbeny lprol ine , 
AdaA, AdaG, els— 3,4, or 5-phenylprollne, 
Bpa, D— Bpa 


Proline 


P 


D— Pro, L-I-thioarolldlne— 4— carboxylic | 
acid, D-or L-l-oxa^lidine— 4-carboxylic U 
acid (Kauer, U.S- Patent (4,511,390) fl 


Serine 


S 


D-Ser, Thr, D-Thr, allo-Thr, Met, D-Met, |j 
Met(O), D-Met (O), L-Cys, D-Cye U 


I Threonine 


T 


D-Thr, Ser, D-Sax, alio— Thr, Met, D-Met, | 
Met CO), D-Met (O), Val, D-Val 1 


1 Tyrosine 


T 


D-Tyr, Phe, D-Phe, L-Dopa, Bis, D-His R 


9 Valine 


V 


D-Val, Leu, D-Leu, lie, D-Zle, Met, D-Met, | 
AdaA, AdaG |) 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

(i) APPLICANTS The General Hospital Corporation 
(ii> TITLE OP INVENTION: INHIBITION OF HEPATITIS B REPLICATION 
(ill) NUMBER OP SEQUENCES t 14 

(lv) CORRESPONDENCE ADDRESS* 

(A) ADDRESSEE z Fish 6 Richardson P.C. 

(B) STREETS 225 Franklin Street 

(C) CITY: Boston 

(D) STATES HA 

(E) COUNTRY: USA 
<F) ZIPS 02110-2804 

(v) COMPUTER READABLE FORMS 

(A) MEDIUM TYPES Floppy disk 

(B) COMPUTERS IBM PC compatible 

(C) OPERATING SYSTEMS PC-DOS/MS-DOS 

(D) SOFTWARES Patent In Release #1.0 r Version #1.30 

<vll) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/017,814 

(B) FILING DATES 20-JUN-1995 

(C) CLASSIFICATIONS 

(Viii) ATTORNEY/AGENT INFORMATION: 

(A) NAMEs Clark, Paul T. 

(B) REGISTRATION NUMBERS 30,162 

(C) REFERENCE /DOCKET NUMBERS 00786/282001 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONES 617/542-5070 

(B) TELEFAX s 617/542-8906 

(C) TELEX: 200154 



{2} INFORMATION FOR SEQ ID NOslt 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTH: 1041 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY s linear 

(ii> MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOsls 

ATG GAC ATA GAT CCC TAT AAA GAA TTT GGT TCA TCT TAT CAG TTG TTG 48 
Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ser Ser Tyr Gin Leu Leu 
15 10 15 

AAT TTT CTT CCT TTG GAC TTC TTT CCT GAC CTT AAT GCT TTG GTG GAC 96 
Asn Phe Leu Pro Leu Asp Phe Phe Pro Asp Leu Asn Ala Leu Val Asp 
20 25 30 
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ACT GCT ACT GCC TT6 TAT 6AA OAA GAG CTA ACA GGT AGO GAA CAT TOC 144 
Thr Aim Thr Ala Leu Tyr Glu Glu Glu Leu Thr Gly Arg Glu Bio Cye 
35 40 45 

TCT COG CAC CAT ACA GCT ATT AGA CAA GCT TTA GTA TGC TGG GAT GAA 192 
Ser Pro Bis Bis Thr Ala lie Arg Gin Ala Leu Val Cys Trp Asp Glu 
50 55 60 

TTA ACT AAA TTG ATA GCT TGG ATG AGC TCT AAC ATA ACT TCT GAA CAA 240 
Leu Thr Lye Leu lie Ala Trp Met: Ser Ser Aon lie Thr Ser Glu Gin 

65 70 75 BO 

GTA AGA ACA ATC ATA GTA AAT CAT GTC AAT GAT ACC TGG GGA CTT AAG 288 
Val Arg Thr lie He Val Asn Ble Val Asn Asp Thr Trp Gly Leu Lye 
85 90 95 

GTG AGA CAA ACT TTA TGG TTT CAT TTG TCA TCT CTC ACT TTC GGA CAA 336 
Val Arg Gin Ser Leu Trp Phe Ble Leu Ser Cye Leu Thr Phe Gly Gin 
lOO 105 110 

CAT ACA GTT CAA GAA TTT TTA GTA AGT TTT GTA GTA TGG ATC AGA ACT 384 
Bis Thr Val Gin Glu Phe Leu Val Ser Phe Val Val Trp> lie Arg Thr 
115 120 125 

CCA GCT CCA TAT AGA CCT OCT AAT GGA CCC ATT CTC TCG ACT CTT COG 432 
Pro Ala Pro Tyr Arg Pro Pro Aen Ala Pro lie Leu Ser Thr Leu Pro 
130 135 140 

GAA CAT ACA GTC ATT AGA AGA GGA GGT GCA AGA GCT TCT AGG TCC CCC 480 
Glu Bis Thr Val He Arg Arg Gly Gly Ala Arg Ala Ser Arg Ser Pro 
145 150 155 160 

AGA AGA CGC ACT CCC TCT CCT CGC AGO AGA AGA TCC CAA AAT TCG GAG 528 
Arg Arg Arg Thr Pro Ser Pro Arg Arg Arg Arg Ser Gin Asn Ser Gin 
165 170 175 

TTC CAA ACT TGC AAA CAC TTG CCA ACC TCC TGT CCA CCA ACT TGC AAT 576 
Phe Gin Thr Cys Lye Ble Leu Pro Thr Ser Cys Pro Pro Thr Cye Asn 
180 185 190 

GGC TTT CGT TGG ATG TAT CTO COG CGT TTT ATC ATA TAC CTA TTA GTC 624 
Gly Phe Arg Trp Mat Tyr Leu Arg Arg Phe lie He Tyr Leu Leu Val 
195 200 205 

CTO CTG CTO TGC CTC ATC TTC TTG TTG GTT CTC CTG GAC TGG AAA GGT 672 
Leu Leu Leu Cys Leu lie Phe Leu Leu Val Leu Leu Asp Trp Lye Gly 
210 215 220 

TTA ATA CCT GTC TGT CCT CTT CAA CCC ACA ACA GAA ACA ACA GTC AAT 720 
Leu Xle Pro Val Cye Pro Leu Gin Pro Thr Thr Glu Thr Thr Val Aen 
225 230 235 240 

TGC AGA CAA TGC ACA ATC TCT GCA CAA GAC ATG TAT ACT CCT CCT TAC 768 
Cye Arg Gin Cys Thr lie Ser Ala Gin Aep Met Tyr Thr Pro Pro Tyr 
245 250 255 

TGT TGT TGT TTA AAA CCT ACG GCA GGA AAT TGC ACT TGT TGG CCC ATC 816 
Cye Cye Cys Leu Lye Pro Thr Ala Gly Aen Cys Thr Cys Trp Pro lie 
260 265 270 

CCT TCA TCA TGG GCT TTA GGA AAT TAC CTA TGG GAG TGG GCC TTA GCT 864 
Pro Ser Ser Trp Ala Leu Gly Asn Tyr Leu Trp Glu Trp Ala Leu Ala 
275 280 285 
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CGT CTC TCT TGG CTC AAT TTA CTA GTG CCC TTG CTT CAA TGG TTA GGA 912 
Arg Leu Ser Trp Leu Asn Leu Leu Val Pro Leu Leu Gin Trp Leu Gly 
290 295 300 

GGA ATT TCC CTC ATT GCG TGG TTT TTG CTT ATA TGG ATG ATT TGG TTT 960 
Gly lie Ser Leu lie Ala Trp Phe Leu Leu lie Trp Met lie Trp Phe 
305 310 315 320 

TGG GGG CCC GCA CTT CTG AGC ATC TTA COG CCA TTT ATT CCC ATA TTT 1008 
Ttd Glv Pro Ala Leu Leu ser He Leu Pro Pro Phe He Pro lie Phe 
325 330 335 

GTT CTG TTT TTC TTG ATT TGG GTA TAC ATT T GA 1041 
Val Leu Phe Phe Leu lie Trp Val Tyr lie 
340 345 

(2) INFORMATION FOR SEQ ID NO: 2: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTHt 346 amino acids 

(B) TYPEs amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Asp He Asp Pro Tyr Lys Glu Phe Gly Ser Ser Tyr Gin Leu Leu 
1 5 10 15 

Asn Phe Leu Pro Leu Asp Phe Phe Pro Asp Leu Asn Ala Leu Val Asp 
20 25 30 

Thr Ala Thr Ala Leu Tyr Glu Glu Glu Leu Thr Gly Arg Glu His Cys 
35 40 45 

Ser Pro His His Thr Ala lie Arg Gin Ala Leu Val Cys Trp Asp Glu 
50 55 60 

Leu Thr Lys Leu He Ala Trp Met Ser Ser Asn He Thr Ser Glu Gin 
65 70 75 SO 

Val Arg Thr He He Val Asn His Val Asn Asp Thr Trp Gly Leu Lys 
85 90 95 

Val Arg Gin Ser Leu Trp Phe His Leu Ser Cys Leu Thr Phe Gly Gin 
100 105 HO 

Hie Thr Val Gin Glu Phe Leu Val Ser Phe Val Val Trp He Arg Thr 
115 120 125 

Pro Ala Pro Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 
130 135 140 

Glu His Thr Val lie Arg Arg Gly Gly Ala Arg Ala Ser Arg Ser Pro 
145 150 155 160 

Arg Arg Arg Thr Pro Ser Pro Arg Arg Arg Arg Ser Gin Asn Ser Gin 
165 170 175 

Phe Gin Thr Cys Lys His Leu Pro Thr Ser Cys Pro Pro Thr Cys Asn 
180 185 190 
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Gly Phe Arg Trp Ket Tyr Leu Arg Arg Phe lie lie Tyr Leu Leu Val 
195 200 205 

Leu Leu Leu Cys Leu lie Phe Leu Leu Val Leu Leu Asp Trp Lye Cly 
210 215 220 

Leu lie Pro Val Cye Pro Leu Gin Pro Thr Thr Glu Thr Thr Val Asn 
225 230 235 240 

Cys Arg Gin Cyo Thr lie Ser Ala Gin Asp Met Tyr Thr Pro Pro Tyr 
245 250 255 

Cys Cys Cys Leu Lye Pro Thr Ala Gly Asn Cys Thr Cye Trp Pro He 
260 265 270 

Pro Ser Ser Trp Ala Leu Gly Asn Tyr Leu Trp Glu Trp Ala Leu Ala 
275 280 285 

Arg Leu Ser Trp Leu Asn Leu Leu Val Pro Leu Leu Gin Trp Leu Gly 
290 295 300 

Gly He Ser Leu lie Ala Trp Phe Leu Leu lie Trp Met lie Trp Phe 
305 310 315 320 

Trp Gly Pro Ala Leu Leu Ser lie Leu Pro Pro Phe He Pro Xle Phe 
325 330 335 

Val Leu Phe Phe Leu He Trp Val Tyr Zle 
340 345 

(2) INFORMATION FOR SEQ ZD NO: 3 1 

(1) SEQUENCE CHARACTERISTICS I 

(A) LENGTH: 1056 base pairs 
. (B) TYPE i nucleic acid 
(C) STRANDEDNBSS s single 
<D) TOPOLOGY* linear 

(11) MOLECULE TYPE J DNA 



<xl) SEQUENCE DESCRIPTION t SEQ ZD KO»3t 

OCT OGC AGO ATG GAC ATC GAC CCT TAT AAA GAA TTT GGA GCT ACT GTG 48 
Pro Arg Arg Met Asp Zle Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val 
350 355 360 

GAG TTA CTC TOG TTT TTG CCT TCT GAC TTC TTT CCT TCA GTA CGA GAT 96 
Glu Leu Leu Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp 
365 370 375 

CTT CTA GAT ACC GCC TCA GCT CTG TAT CGG GAA GCC TTA GAG TCT CCT 144 
Leu Leu Asp Thr Ala ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro 
380 385 390 

GAG CAT TOT TCA CCT CAC CAT ACT CCA CTC AGG GAA GGA ATT CTT TGC 192 
Glu His Cys Ser Pro His Bis Thr Ala Leu Arg Gin Ala Zle Leu Cys 
395 400 405 410 

TGG GGG GAA CTA ATG ACT CTA GCT ACC TGG GTG GGT GTT AAT TTG GAA 240 
Trp Gly Glu Leu Met Thr Leu Ala Thr Trp Val Gly Val Asn Leu Glu 
415 420 425 
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GAT CCA GCG TCT AGA GAC CTA GTA GTC AGT TAT GTC AAC ACT AAT ATG 288 
Asp Pro Ala Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Aan Met 
430 435 440 

GGC CTA AAG TTC AGG CAA CTC TTG TGG TTT CAC ATT TCT TGT CTC ACT 336 
Gly Leu Lys Phe Arg Gin Leu Leu Trp Phe Bis He Ser Cys Leu Thr 
445 450 455 

TTT GGA AGA GAA ACA GTT ATA GAG TAT TTG GTG TCT TTC GGA GTG TGG 384 
Phe Gly Arg Glu Thr Val lie Glu Tyr Leu Val Ser Phe Gly Val Trp 
460 465 470 

ATT CGC ACT CCT CCA GCT TAT AGA CCA CCA AAT GCC CCT ATC CTA TCA 432 
lie Arg Thr Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser 
475 480 485 490 

AGA CTT CCG GAA CAT ACA GTC ATT AGA AGA GGA GGT GGA AGA GCT TCT 480 
Thr Leu Pro Glu His Thr Val lie Arg Arg Gly Gly Ala Arg Ala Ser 
495 500 505 

AGG TCC CCC AGA AGA CGC ACT CCC TCT CCT CGC AGG AGA AGA TCC CAA 528 
Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro Arg Arg Arg Arg Ser Gin 
510 515 520 

AAT TCG GAG TCC CCA ACC TCC AAT CAC TCA CCA ACC TCT TGT CCT CCA 576 
Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr Ser Cys Pro Pro 
525 530 535 

ACT TGT CCT GGT TAT CGC TGG ATG TGT CTG CGG CGT TTT ATC ATC TTC 624 
Thr Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe 
540 545 550 

CTC TTC ATC CTG CTG CTA TGC CTC ATC TTC TTG TTG GTT CTT CTG GAC 672 
Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu Val Leu Leu Asp 
555 560 565 570 

TAT CAA GGT ATG TTG CCC GTT TGT CCT CTA ATT CCA GGA TCC TCA ACA 720 
Tyr Gin Gly Met Leu Pro Val Cys Pro Leu lie Pro Gly Ser Ser Thr 
575 580 585 

ACC AGC ACG GGA CCA TGC CGG ACC TGC ATG ACT ACT GCT CAA GGA ACC 768 
Thr Ser Thr Gly Pro Cys Arg Thr Cys Met Thr Thr Ala Gin Gly Thr 
590 595 600 

TCT ATG TAT CCC TCC TGT TGC TGT ACC AAA CCT TCG GAC GGA AAT TGC 816 
Ser Met Tyr Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Gly Asn Cys 
605 610 615 

ACC TGT ATT CCC ATC OCA TCA TCC TGG GCT TTC GGA AAA TTC CTA TGG 864 
Thr Cys lie Pro lie Pro ser Ser Trp Ala Phe Gly Lys Phe Leu Trp 
620 625 630 

GAG TGG GCC TCA GCC CGT TTC TCC TGG CTC AGT TTA CTA GTG CCA TTT 912 
Glu Trp Ala Ser Ala Arg Phe Ser Trp Leu Ser Leu Leu Val Pro Phe 
635 640 645 650 

GTT CAS TGG TTC OTA GGG CTT TCC CCC ACT GTT TGG CTT TCA GTT ATA 960 
Val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu Ser Val lie 
655 660 665 

TGG ATG ATG TGG TAT TGG GGG CCA AGT CTG TAG AGC ATC TTG AGT CCC 1008 
Tro Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Ser lie Leu Ser Pro 
670 675 680 
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TTT TTA COG CTG TTA CCA ATT TTC TTT TGT CTT TGG GTA TAC ATT T 1054 
Phe Leu Pro Leu Leu Pro lie Phe Phe Cyo Leu Trp Val Tyr He 
685 690 695 

AA 1056 

(2) INFORMATION FOR SEQ ID NOl4l 

(1) SEQUENCE CHARACTERISTICS x 

(A) lengths 351 amino acids 

(B) TYPE* amino acid 
(D) TOPOLOGY* linear 

(11) MOLECULE TYPE i protein 

<xi) SEQUENCE DESCRIPTIONS SEQ ID NOt4t 

Pro Arg Arg Met Asp He Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val 

15 10 15 

Glu Leu Leu Ser Phe Leu Pro Ser Asp Phe Phe Pro ser Val Arg Asp 
20 25 30 

Leu Leu Asp Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro 
35 40 45 

Glu Bis Cys Ser Pro Bis Bis Thr Ala Leu Arg Gin Ala He Leu Cys 

50 55 60 

Trp Gly Glu Leu Met Thr Leu Ala Thr Trp Val Gly Val Aan Leu Glu 
65 70 7S SO 

Asp Pro Ala Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met 

85 90 95 * 

Gly Leu Lys Phe Arg Gin Leu Leu Trp Phe Bis lie Ser Cys Leu Thr 
lOO 105 110 

Phe Gly Arg Glu Thr Val He Glu Tyr Leu Val Ser Phe Gly Val Trp 
115 120 125 

He Arg Thr Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser 
130 135 140 

Thr Leu Pro Glu Bis Thr Val He Arg Arg Gly Gly Ala Arg Ala Ser 
145 150 155 160 

Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro Arg Arg Arg Arg Ser Gin 
165 170 175 

Asn Ser Gin Ser Pro Thr Ser Asn Bis Ser Pro Thr Ser Cys Pro Pro 
180 185 190 

Thr Cys Pro Gly Tyr Arg Trp Met Cya Leu Arg Arg Phe He He Phe 
195 20O 205 

Leu Phe He Leu Leu Leu Cys Leu He Phe Leu Leu Val Leu Leu Asp 
210 215 220 

Tyr Gin Gly Met Leu Pro Val Cys Pro Leu He Pro Gly Ser Ser Thr 
225 230 235 240 
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Thr Ser Thr Gly Pro Cye Arg Thr Cys Met Thr Thr Ala Gin Gly Thr 
245 250 255 

Ser Met Tyr Pro Ser Cye Cys Cys Thr Lye Pro Ser Asp Gly Aan Cye 
260 265 270 

Thr Cys He Pro He Pro Ser Ser Trp Ala Phe Gly Lys Phe Leu Trp 
275 280 285 

Glu Trp Ala Ser Ala Arg Phe Ser Trp Leu Ser Leu Leu Val Pro Phe 
290 295 300 

Val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu Ser Val He 

305 310 315 320 

Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Ser He Leu Ser Pro 
325 330 335 

Phe Leu Pro Leu Leu Pro He Phe Phe Cys Leu Trp Val Tyr He 
340 345 350 

(2) INFORMATION FOR SEQ ID NOi5: 

(i) SEQUENCE CHARACTERISTICS S 

(A) LENGTH t 1194 base pairs 

(B) TYPEx nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DBA 



(xi) SEQUENCE DESCRIPTION! SEQ ID NOiSt 

ATG GAC ATC GAC CCT TAT AAA GAA TTT GGA GCT ACT GTG GAG TTA CTC 48 
Met Asp He Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
355 360 365 

TOG TTT TTG CCT TCT GAC TTC TTT CCT TCA GTA CGA GAT CTT CTA GAT 96 
Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
370 375 380 

ACC OCC TCA GCT CTG TAT CGG GAA GCC TTA GAG TCT CCT GAG CAT TGT 144 
Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
385 390 395 

TCA CCT CAC CAT ACT GCA CTC AGG CAA GGA ATT CTT TGC TGG GGG GAA 192 
Ser Pro His Bis Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu 
400 405 410 415 

CTA ATG ACT CTA GCT ACC TGG GTG GGT GTT AAT TTG GAA GAT CCA GCG 240 
Lsu Met Thr Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala 
420 425 430 

TCT AGA GAC CTA GTA GTC AGT TAT GTC AAC ACT AAT ATG GGC CTA AAG 288 
Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
435 440 445 

TTC AGG CAA CTC TTG TGG TTT CAC ATT TCT TGT CTC ACT TTT GGA AGA 336 
Phe Arg Gin Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg 
450 455 460 
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GAA ACA GTT ATA GAG TAT TTO GTO TCT TTC GGA GTG TGG ATT CGC ACT 384 
OXu Thr Val Xle Glu Tyr Leu Val Ser Phe Gly Val Trp Xle Arg Thr 
465 470 475 

CCT CCA GCT TAT AGA CCA CCA AAT GCC CCT ATC CTA TCA ACA CTT CCG 432 
Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
480 485 490 495 

GAG ACT ACT GTT GTT AGA CGA CCA GGC AGG TCC CCT AGA AGA AGA ACT 480 
Glu Thr Thr Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
500. 505 510 

CCC TCG CCT CGC AGA CGA AGG TCT GAA TOG CCG CGT CGC AGA AGA TCT 528 
Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser . 
515 520 525 

GAA TCT CGG CTA GGA CCC CTT CTC GTG TTA GAG GCG GGG TTT TTC TTG 576 
Gin Ser Arg Leu Gly Pro Leu Leu Val Leu Gin Ala Gly Phe Phe Leu 
530 535 540 

TTG ACA AGA ATC CTC ACA ATA CCG GAG AGT CTA GAC TCG TGG TGG ACT 624 
Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Aap Ser Trp Trp Thr 

545 550 555 

TCT CTC AAT TTT CTA GGG GGA ACT ACC GTG TGT CTT GGC GAA AAT TCG 672 
Ser Leu Aan Phe Leu Gly Gly Thr Thr Val Cys Leu Gly Gin Asn ser 
560 565 570 575 

GAG TCC CCA ACC TCC AAT GAC TCA CCA ACC TCT TGT CCT CCA ACT TGT 720 
Gin Ser Pro Thr Ser Aen Bis Ser Pro Thr Ser Cya Pro Pro Thr Cys 
580 585 590 

CCT GOT TAT CGC TOO ATG TGT CTG CGG CGT TTT ATC ATC TTC CTC TTC 768 
Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe Xle Xle Phe Leu Phe 
595 600 605 

ATC CTG CTG CTA TGC CTC ATC TTC TTG TTG GTT CTT CTG GAC TAT CAA 816 
Xle Leu Leu Leu Cya Leu Xle Phe Leu Leu Val Leu Leu Aap Tyr Gin 
610 615 620 

GOT ATG TTG CCC GTT TGT CCT CTA ATT CCA GGA TCC TCA ACA ACC AGC 864 
Gly Met Leu Pro Val Cys Pro Leu Xle Pro Gly Ser Ser Thr Thr Ser 
625 630 635 

AGG GGA CCA TGC CGG ACC TGC ATG ACT ACT GCT CAA GGA ACC TCT ATG 912 
Thr Gly Pro Cys Arg Thr Cys Met Thr Thr Ala Gin Gly Thr Ser Met 
640 645 650 655 

TAT CCC TCC TGT TGC TGT ACC AAA CCT TCG GAC GGA AAT TGC ACC TGT 960 
Tyr Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Gly Asn Cys Thr Cys 
660 665 670 

ATT CCC ATC CCA TCA TCC TGG GCT TTC GGA AAA TTC CTA TGG GAG TGG 1008 
lie Pro Xle Pro Ser Ser Trp Ala Phe Gly Lys Phe Leu Trp Glu Trp 
675 680 685 

GCC TCA GCC CGT TTC TCC TGG CTC AGT TTA CTA GTG CCA TTT GTT GAG 1056 
Ala Ser Ala Arg Phe Ser Trp Leu Ser Leu Leu Val Pro Phe Val Gin 
690 695 700 

TGG TTC GTA GGG CTT TCC CCC ACT GTT TGG CTT TCA GTT ATA TGG ATG 1104 
Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu Ser Val lie Trp Met 
705 710 715 
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ATG TGG TAT TGG GGG OCA ACT CTG TAC AGC ATC TTG AGT CCC TTT TTA 1152 
Met Trp Tyr Trp Gly Pro Ser Leu Tyr Ser He Leu Ser Pro Phe Leu 
720 725 730 735 

CCG CTG TTA CCA ATT TTC TTT TGT CTT TGG GTA TAC ATT T AA 1194 
Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val Tyr He 
740 745 

(2) INFORMATION FOR SEQ ID NO: 6 s 

<i) SEQUENCE CHARACTERISTICS t 

(A) LENGTHS 397 amino acids 

(B) TYPEs amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

fxi) SEQUENCE DESCRIPTIONS SEQ ID NO: 6: 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
15 10 15 

Ser Phe Leu Pro ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala 
65 70 75 80 

Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

Phe Arg Gin Leu Leu Trp Phe His lie Ser Cye Leu Thr Phe Gly Arg 
100 105 110 

Glu Thr Val lie Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
130 135 140 

Glu Thr Thr Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 155 160 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 170 175 

Gin Ser Arg Leu Gly Pro Leu Leu Val Leu Gin Ala Gly Phe Phe Leu 
180 185 190 

Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp Ser Trp Trp Thr 
195 200 205 

Ser Leu Asn Phe Leu Gly Gly Thr Thr Val Cys Leu Gly Gin Asn Ser 
210 215 220 
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Gin Ser Pro Thr Ser Ann His Ser Pro Thr Sor Cys Pro Pro Thr Cys 
225 230 235 240 

Pro Gly Tyr Arg Trp Met cys Leu Arg Arg Phe He He Phe Leu Phe 
245 250 255 

lie Leu Leu Leu Cys Leu He Phe Leu Leu Val Leu Leu Asp Tyr Gin 
260 265 270 

Gly Met Leu Pro V&l cys Pro Leu lie Pro Gly Ser Ser Thr Thr Ser 
275 280 285 

Thr Gly Pro Cys Arg Thr Cys Met Thr Thr Ala Gin Gly Thr ser Met 
290 295 30O 

Tyr Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Gly Asm Cys Thr Cys 
305 310 315 320 

Zle Pro He Pro Ser Ser Trp Ala Phe Gly Lys Phe Leu Trp Glu Trp 

325 330 335 

Ala Ser Ala Arg Phe Ser Trp Leu Ser Leu Leu Val Pro Phe Val Gin 
340 345 350 

Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu Ser Val Xle Trp Met 

355 360 365 

Met Trp Tyr Trp Gly Pro Ser Leu Tyr Ser He Leu Ser Pro Phe Leu 
370 375 380 

Pro Leu Leu Pro He Phe Phe Cys Leu Trp Val Tyr Zle 
385 390 395 

(2) INFORMATION FOR SBQ ZD NOt7t 

(1) SEQUENCE CHARACTERISTICS t 

(A) LENGTH x 870 base pairs 

(B) TTPEs nucleic acid 

(C) STRANDBDNESS t single 

(D) TOPOLOGY* linear 

(11) MOLECULE TYPES DMA 

(jcl) SEQUENCE DESCRIPTION t SEQ ZD NOl7l 

ATG GAC ATC GAC CCT TAT AAA GAA TTT GGA GCT ACT GTG GAG TTA CTC 48 
Met Asp He Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
400 405 410 

TCG TTT TTG CCT TCT GAC TTC TTT CCT TCA GTA CGA GAT CTT CTA GAT 96 
Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
415 420 425 

ACC GCC TCA GCT CTG TAT CGG GAA GCC TTA GAG TCT CCT GAG CAT TGT 144 
Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu Bis Cys 
430 435 440 445 

TCA CCT CAC CAT ACT GGA CTC AGG GAA GGA ATT CTT TGC TGG GGG GAA 192 
Ser Pro His His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu 
450 455 460 
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CTA ATG ACT CTA GCT ACC TGG GTG GGT GTT AAT TTG GAA GAT CCA GCG 
Leu Met Thr Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala 
465 470 475 

TCT AGA GAC CTA GTA GTC ACT TAT GTC AAC ACT AAT ATG GGC CTA AAG 
Ser Arg Aap Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys 
480 485 490 

TTC AGG GAA CTC TTG TGG TTT CAC ATT TCT TGT CTC ACT TTT GGA AGA 
Phe Arg Gin Leu Leu Trp Phe HLb He Ser Cys Leu Thr Phe Gly Arg 
495 500 505 

GAA AGA GTT ATA GAG TAT TTG GTG TCT TTC GGA GTG TGG ATT CGC ACT 
Glu Thr Val He Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr 
510 515 520 525 

OCT CCA GCT TAT AGA CCA CCA AAT GCC CCT ATC CTA TCA AGA CTT CCG 
Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 



GAG ACT ACT GTT GTT AGA CGA CGA GGC AGG TCC CCT AGA AGA AGA ACT 
Glu Thr Thr Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
545 550 555 

CCC TOG CCT CGC AGA CGA AGG TCT GAA TOG CCG GGT CGC AGA AGA TCC 
Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Aro Ser 
560 565 570 

TCA AGA ACC AGG ACG GGA CCA TGG CGG ACC TGC ATG ACT ACT GCT GAA 
Ser Thr Thr Ser Thr Gly Pro Cys Arg Thr Cys Met Thr Thr Ala Gin 
575 580 585 

GGA ACG TCT ATG TAT CCC TCC TGT TGG TGT ACG AAA CCT TGG GAG GGA 
Gly Thr Ser Met Tyr Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Gly 
590 595 600 605 

AAT TGC ACC TGT ATT CCC ATC CCA TCA TCC TGG GCT TTC GGA AAA TTC 
Asn Cys Thr Cys lie Pro He Pro Ser Ser Trp Ala Phe Gly Lye Phe 
610 615 620 

CTA TGG GAG TGG GCC TCA GCC CGT TTC TCC TGG CTC AGT TTA CTA GTG 
Leu Trp Glu Trp Ala Ser Ala Arg Phe Ser Trp Leu Ser Leu Leu Val 
625 630 635 

CCA TTT GTT GAG TGG TTC GTA GGG CTT TCC CCC ACT GTT TGG CTT TCA 
Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu Ser 
640 645 650 

GTT ATA TGG ATG ATG TGG TAT TGG GGG CGA AGT CTG TAG AGG ATC TTG 
Val He Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Ser He Leu 
655 660 665 

AGT CCC TTT TTA CCG CTG TTA CCA ATT TTC TTT TGT CTT TGG GTA TAG 
Ser Pro Phe Leu Pro Leu Leu Pro He Phe Phe Cys Leu Trp Val Tyr 
670 675 680 685 

ATT T AA 
He 



(2) INFORMATION FOR SBQ ID NO: 8: 
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(i) SBQUENCB CHARACTERISTICS l 

(A) LENGTHS 289 amino acids 

(B) TYFEs amino acid 
CO) TOPOLOGY s linear 

(11) XOLBCULK TYPBi protein 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Mat Asp Xle Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
15 10 15 

Ser Phe Leu Pro Ser Aep Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu Bis Cys 
35 40 45 

Ser Pro Bis Bis Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

Leu Met: Thr Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala 
65 70 75 80 

Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lye 
85 90 95 

Phe Arg Gin Leu Leu Trp Phe Bis lie Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Glu Thr Val lie Glu Tyr Leu Val Ser Phe. Gly Val Trp Xle Arg Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
130 135 140 

Glu Thr Thr Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 155 160 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 170 175 

Ser Thr Thr Ser Thr Gly Pro Cys Arg Thr Cys Met Thr Thr Ala Gin 
180 185 190 

Gly Thr Ser Met Tyr Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Gly 

195 200 205 

Asn Cys Thr Cys He Pro Xle Pro Ser Ser Trp Ala Phe Gly Lys Phe 
210 215 220 

Leu Trp Glu Trp Ala Ser Ala Arg Phe Ser Trp Leu Ser Leu Leu Val 
225 230 . 235 240 

Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu Ser 
245 250 255 

Val Xle Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Ser lie Leu 
260 265 270 

Ser Pro Phe Leu Pro Leu Leu Pro He Phe Phe Cys Leu Trp Val Tyr 
275 280 285 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH i 1263 base pairs 

(B) TYPE i nucleic acid 

(C) STRAND EDNESS t single 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPES DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9s 

ATO GAT ATC AAT GOT TCT AGA GCC TTA GCC AAT GTG TAT GAT CTA OCA 48 
Mel: Aep lie Asn Ala Ser Arg Ala Leu Ala Asn Val Tyr Asp Leu Pro 
290 295 300 305 

GAT GAT TTC TTT CCA AAA ATA GAT GAT CTT GTT AGA GAT GCT AAA GAC 96 
Asp Asp Phe Phe Pro Lys lie Asp Asp Leu Val Arg Asp Ala Lys Asp 
310 315 320 

GCT TTA GAG CCT TAT TGG AAA TCA GAT TCA ATA AAG AAA CAT GTT TTG 144 
Ala Leu Glu Pro Tyr Trp Lys Ser Asp Ser lie Lys Lys His Val Leu 
325 330 335 

ATT GCA ACT CAC TTT GTG GAT CTT ATT GAA GAC TTC TGG GAG ACT AGA 192 
lie Ala Thr His Phe Val Asp Leu lie Glu Asp Phe Trp Gin Thr Thr 
340 345 350 

GAG GGC ATG CAT GAA ATA GCC GAA TCA TTA AGA GCT GTT ATA CCT CCC 240 
Gin Gly Met: His Glu He Ala Glu Ser Leu Arg Ala Val lie Pro Pro 
355 360 365 

ACT ACT ACT CCT GTT CCA COG GGT TAT CTT ATT GAG CAC GAA GAA GCT 288 
Thr Thr Thr Pro Val Pro Pro Gly Tyr Leu He Gin His Glu Glu Ala 
370 375 380 385 

GAA GAG ATA CCT TTG GGA GAT TTA TTT AAA CAC GAA GAA GAA AGG ATA 336 
Glu Glu lie Pro Leu Gly Asp Leu Phe Lys His Gin Glu Glu Arg lie 
390 395 400 

GTG AGT TTC CAA CCC GAC TAT CCG ATT ACG GCT AGA ATT CAT GCT CAT 384 
Val Ser Phe Gin Pro Asp Tyr Pro He Thr Ala Arg He His Ala His 
405 410 415 

TTG AAA GCT TAT GCA AAA ATT AAC GAG GAA TCA CTG GAT AGG GCT AGG 432 
Leu Lys Ala Tyr Ala Lys He Asn Glu Glu Ser Leu Asp Arg Ala Arg 
420 425 430 

AGA TTG CTT TGG TGG CAT TAC AAC TGT TTA CTG TGG GGA GAA GCT CAA 480 
Arg Leu Leu Trp Trp His Tyr Asn Cys Leu Leu Trp Gly Glu Ala Gin 
435 440 445 

GTT ACT AAC TAT ATT TCT CGC TTG CGT ACT TGG TTG TCA ACT CCT GAG 528 
Val Thr Aen Tyr He Ser Arg Leu Arg Thr Trp Leu Ser Thr Pro Glu 
450 455 460 465 
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AAA TAT A6A GGT AGA GAT GCC CCG AOC ATT GAA GCA ATC ACT AGA CCA 576 
Lys Tyr Arg Gly Arg Asp Ala Pro Thr He GXu Ala lie Thr Arg Pro 
470 475 480 

ATC CAO GTG OCT CAG GGA GGC CGA AAA ACA ACT ACG GGT ACT AGA AAA 624 
XI* Gin Val Ala Gin Gly Gly Arg Lys Thr Thr Thr Gly. Thr Arg Lye 
485 490 495 

OCT GGT GGA CTC GAA CCT AGA AGA AGA AAA GTT AAA ACC ACA GTT GTC 672 
Pro Arg Gly Leu Glu Pro Arg Arg Arg Lys Val Lys Thr Thr Val Val 
500 505 510 

TAT GGG AGA AGA COT TCA AAG TCC CGG GGA AGO AGA GCC CCT ACA CCC 720 
Tyr Gly Arg Arg Arg Ser Lys Ser Arg Gly Arg Arg" Ala Pro Thr Pro 
515 520 525 

GAA CGT GCG GGC TCC CCT CTC CCA CGT AGT TOG AGC AGC CAC CAT AGA 768 
Gin arg Ala Gly Ser Pro Leu Pro Arg Ser Ser Ser Ser Bis Hie Arg 
530 535 540 545 

TCC TTC GGG GGA ATA CTA GCT GGC CTA ATC GGA TTA CTG GTA AGC TTT 816 
Ser Phe Gly - Gly He Leu Ala Gly Leu Zle Gly Leu Leu Val Ser Phe 

550 555 560 

TTC TTG TTG ATA AAA ATT CTA GAA ATA CTG AGG AGG CTA GAT TGG TGG 864 
Phe Leu Leu Zle Lye Zle lieu Glu Zle Leu Arg Arg Leu Asp Trp Trp 
565 570 575 

TGG ATT TCT CTC AGT TCT CCA AAG GGA AAA ATC CAA TGC GCT TTC CAA 912 
Trp Zle Ser Leu Ser Ser Pro Lys Gly Lys Met Gin Cys Ala Phe Gin 
580 585 590 

GAT ACT GGA GCC CAA ATC TCT CCA CAT TAC GTC GGA TCT TGC CCG TGG 960 
Asp Thr Gly Ala Gin Zle Ser Pro Bis Tyr Val Gly Ser Cys Pro Trp 
595 600 605 

GGA TGC CCA GGA TTT CTT TGG ACC TAT CTC AGG CTT TTT ATC ATC TTC 1008 
Gly Cys Pro Gly Phe Leu Trp Thr Tyr Leu Arg Leu Phe Zle Zle Phe 
610 615 620 625 

CTC TTA ATC CTG CTA GTA GCA GCA GGC TTG CTG TAT CTG ACG GAC AAC 1056 
Leu Leu Zle Leu Leu Val Ala Ala Gly Leu Leu Tyr Leu Thr Asp Asn 
630 635 640 

GGG TCT ACT ATT TTA GGA AAG CTC CAA TGG GCG TCG GTC TCA GCC CTT 1104 
Gly Ser Thr Zle Leu Gly Lys Leu Gin Trp Ala Ser Val Ser Ala Leu 
645 650 655 

TTC TCC TCC ATC TCT TCA CTA CTG CCC TCG GAT CCG AAA TCT CTC GTC 1152 
Phe Ser Ser Zle Ser Ser Leu Leu Pro Ser Asp pro Lys Ser Leu Val 
660 665 670 

GCT TTA ACQ TTT GGA CTT TCA CTT ATA TGG ATG ACT TCC TCC TCT GCC 1200 
Ala Leu Thr Phe Gly Leu Ser Leu Zle Trp Met Thr Ser Ser Ser Ala 
675 680 685 

ACC CAA ACG CTC GTC ACC TTA ACG CAA TTA GCC ACG CTG TCT GCT CTT 1248 
Thr Gin Thr Leu Val Thr Leu Thr Gin Leu Ala Thr Leu Ser Ala Leu 
690 695 700 705 

TTT TAC AAG AGC T AG 1263 
Phe Tyr Lys Ser 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 420 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOxlO: 

Met Asp lie Asn Ala Ser Arg Ala Leu Ala Asn Val Tyr Asp Leu Pro 
15 10 15 

Asp Asp Pbe Phe Pro Lys lie Asp Asp Leu Val Arg Asp Ala Lys Asp 
20 25 30 

Ala Leu Glu Pro Tyr Trp Lys Ser Asp Ser lie Lys Lys Bis Val Leu 
35 40 45 

He Ala Thr Bis Phe Val Asp Leu lie Glu Asp Phe Trp Gin Thr Thr 
50 55 60 

Gin Gly Met Bis Glu He Ala Glu Ser Leu Arg Ala Val He Pro Pro 
65 70 75 80 

Thr Thr Thr Pro Val Pro Pro Gly Tyr Leu lie Gin His Glu Glu Ala 
85 90 95 

Glu Glu He Pro Leu Gly Asp Leu Phe Lys. His Gin Glu Glu Arg He 
100 105 110 

Val Ser Phe Gin Pro Aep Tyr Pro He Thr Ala Arg He His Ala His 
115 120 125 

Leu Lys Ala Tyr Ala Lys He Asn Glu Glu Ser Leu Asp Arg Ala Arg 
130 135 140 

Arg Leu Leu Trp Trp His Tyr Asn Cys Leu Leu Trp Gly Glu Ala Gin 
145 150 155 160 

Val Thr Asn Tyr He Ser Arg Leu Arg Thr Trp Leu Ser Thr Pro Glu 
165 170 175 

Lys Tyr Arg Gly Arg Asp Ala Pro Thr He Glu Ala He Thr Arg Pro 
180 185 190 

He Gin Val Ala Gin Gly Gly Arg Lys Thr Thr Thr Gly Thr Arg Lys 
195 200 205 

Pro Arg Gly Leu Glu Pro Arg Arg Arg Lys Val Lys Thr Thr Val Val 
210 215 220 

Tyr Gly Arg Arg Arg Ser Lys Ser Arg Gly Arg Arg Ala Pro Thr Pro 
225 230 235 240 

Gin Arg Ala Gly Ser Pro Leu Pro Arg Ser Ser Ser Ser Bis His Arg 
245 250 255 

Ser Phe Gly Gly He Leu Ala Gly Leu lie Gly Leu Leu Val Ser Phe 
260 265 270 
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Phe Leu Leu lie Lye He Leu Glu He Leu Arg Arg Leu Asp Trp Trp 
275 280 285 

Trp He Ser Leu Ser Ser Pro Lys Gly Lye Met: Gin Cys Ala Phe Gin 
290 295 300 

Asp Thr Gly Ala Gin He Ser Pro Bis Tyr Val Gly Ser Cys Pro Trp 
305 310 315 320 

Gly eye Pro Gly Phe Leu Trp Thr Tyr Leu Arg Leu Phe He He Phe 
325 330 335 

Leu Leu He Leu Leu Val Ala Ala Gly Leu Leu Tyr Leu Thr Asp Asn 

340 345 350 

Gly Ser Thr He Leu Gly Lys Leu Gin Trp Ala Ser Val Ser Ala Leu 
355 360 365 

Phe Ser Ser He Ser Ser Leu Leu Pro Ser Asp Pro Lys Ser Leu Val 
370 375 380 

Ala Leu Thr Phe Gly Leu Ser Leu He Trp Met Thr Ser Ser Ser Ala 
385 390 395 400 

Thr Gin Thr Leu Val Thr Leu Thr Gin Leu Ala Thr Leu Ser Ala Leu 
405 410 415 

Phe Tyr Lys Ser 
420 

(2) INFORMATION FOR SBQ ID NO tilt 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTHS 552 base pairs 
<B) TYPE x nucleic acid 

(C) STRANDEDKBSS J single 

(D) TOPOLOGY: linear 

{11) MOLECULE TYPES DMA 



(Xl) SEQUENCE DESCRIPTION: SBQ ID NOslli 

ASG GAC ATC GAC CCT TAT AAA GAA TTT GGA GCT ACT GTC GAG TTA CTC 48 
Met Asp He Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
1 5 10 15 

TOG TTT TTG CCT TCT GAC TTC TTT CCT TCA GTA GGA GAT CTT CTA GAT 96 
Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

ACC GCC TCA GCT CTG TAT CGG GAA GCC TTA GAG TCT CCT GAG CAT TCT 144 
Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu Bis Cys 
35 40 45 

TCA CCT GAC CAT ACT GGA CTC AGG GAA GGA ATT CTT TGC TGG GGG GAA 192 
8er Pro His His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu 
SO 55 60 

CTA ATG ACT CTA GCT ACC TGG GTG GGT GTT AAT TTG GAA GAT CCA GCG 240 
Leu Met Thr Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala 
65 70 75 60 
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TCT AGA GAC CTA GTA 6TC AGT TAT OTC AAC ACT AAT ATG GGC CTA AAG 288 
Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met; Gly Leu Lye 
85 90 95 

TTC AGO CAA CTC TTG TGG TTT CAC ATT TCT TGT CTC ACT TTT CGA ACA 336 
Phe Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Thr 
100 105 110 

GAA ACA GTT ATA GAG TAT TTG GTG TCT TTC GGA GTG TGG ATT CGC ACT 384 
Glu Thr Val He Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr 
115 120 125 

CCT CCA GCT TAT AGA CCA CCA AAT GCC CCT ATC CTA TCA ACA CTT COG 432 
Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro He Leu ser Thr Leu Pro 
130 135 140 

GAG ACT ACT GTT GTT AGA CGA CCA GGC AGG TCC CCT AGA AGA AGA ACT 480 
Glu Thr Thr Val Val Arg Arg Pro Gly Arg ser Pro Arg Arg Arg Thr 
145 150 155 160 

COC TCG OCT CGC AGA CGA AGG TCT CAA TCG CCC CGT CGC AGA AGA TCT 528 
Pro ser Pro Arg Arg Arg Arg Ser Gin ser Pro Arg Arg Arg Arg Ser 
165 170 175 

CAA TCT CGG GAA TCT CAA TGT TAG 552 

Gin Ser Arg Glu Ser Gin Cys 
180 

<2) INFORMATION FOR SEQ ID NO: 12: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 amino acids 

(B) TYPEs amino acid 
(D) TOPOLOGY t linear 

(11) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 1 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
1 5 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu Hie Cys 
35 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala 
65 70 75 80 

Ser Arg Asp Leu Val Val Ser Tyr Val Asn Thr Asn Met: Gly Leu Lys 
85 90 95 

Phe Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Thr 
100 105 110 

Glu Thr Val lie Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr 
115 120 125 
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Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro II* Lou Ser Thr Leu Pro 

130 135 140 

Glu Thr Thr Val Val Arg Arg Pro Gly Arg Ser Pro Arg Arg Arg Thr 
145 150 155 160 

Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser 
165 170 175 

Gin Ser Arg Olu Ser Gin Cys 

180 

(2) INFORMATION FOR SBQ ID NOil3t 

(1) SEQUENCE CHARACTERISTICS r 

(A) LENGTHS 681 base pairs 

(B) TYPE i nucleic acid 
<C) STRANDEDNBSSs single 
(D) TOPOLOGY t linear 

(li) HOLE COLE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION s SEQ ID NO*13» 

ATG GAG AAC ATC ACA TCA GGA TTC CTA GGA CCC CTT CTC GTG TTA GAG 48 
Met Gin Asn lie Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin 
1 5 10 15 

GCG GGG TTT TTC TTG TTG ACA AGA ATC CTC ACA ATA CCC GA0 ACT CTA ' 96 

Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu 
20 25 30 

GAC TOG TGG TOG ACT TCT CTC AAT TTT CTA GGG GGA ACT ACC GTG TGT 144 
Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Thr Thr Val Cys 
35 40 45 

CTT GGC CAA AAT TOG CAG TCC CCA ACC TCC AAT CAC TCA CCA ACC TCT 192 
Leu Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn Bis Ser Pro Thr Ser 
50 55 60 

TGT OCT CCA ACT TGT CCT GGT TAT C6C TGG ATG TGT CTG CCC CGT TTT 240 
cys Pro Pro Thr Cys Pro Gly Tyr Arg Trp Net Cys Leu Arg Arg Phe 
65 70 75 80 

ATC ATC TTC CTC TTC ATC CTG CTG CTA TGC CTC ATC TTC TTG TTG GTT 288 
lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu Val 
85 90 95 

CTT CTG GAC TAT CAA GGT ATG TTG CCC GTT TGT CCT CTA ATT CCA GGA 336 
Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu lie Pro Gly 
100 105 110 

TCC TCA ACA ACC AGC ACG GGA CCA TGC CGG ACC TGC ATG ACT ACT OCT 384 
Ser Ser Thr Thr Ser Thr Gly Pro Cys Arg Thr Cys Met Thr Thr Ala 
115 120 125 

CAA GGA ACC TCT ATG TAT CCC TCC TGT TGC TGT ACC AAA CCT TGG GAC 432 
Gin Gly Thr Ser Met Tyr Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp 
130 135 140 

GGA AAT TGC ACC TGT ATT CCC ATC CCA TCA TCC TGG GCT TTC GGA AAA 480 
Gly Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Gly Lys 
145 150 155 160 
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TTC CTA TGG GAG TGG GCC TCA GCC CCT TTC TCC TGG CTC ACT TTA CTA 528 

Phe Leu Trp Glu Trp Ala Ser Ala Pro Phe Ser Trp Leu ser Leu Leu 
165 170 175 

GTC CCA TTT GTT CAG TGG TTC GTA GGG CTT TCC CCC ACT GTT TGG CTT 576 
Val Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu 
180 185 190 

TCA GTT ATA TGG ATG ATG TGG TAT TGG GGG CCA AGT CTG TAC AGC ATC 624 
Ser Val lie Trp Met: Met: Trp Tyr Trp Gly Pro Ser Leu Tyr Ser lie 
195 200 205 

TTG AGT CCC TTT TTA CCG CTG TTA CCA ATT TTC TTT TGT CTT TGG GTA 672 
Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val 
210 215 220 

TAC ATT TAA 681 

Tyr lie 

225 

(2) INFORMATION FOR SEQ ID NO: 14: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 amino acids 

(B) TOPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Glu Asn lie Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin 
15 10 15 

Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu 
20 25 30 

Aep Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Thr Thr Val Cye 
35 40 45 

Leu Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr Ser 
50 55 60 

Cys Pro Pro Thr Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe 
65 70 75 80 

lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu Val 
85 90 95 

Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu lie Pro Gly 
100 105 110 

Ser Ser Thr Thr Ser Thr Gly Pro Cys Arg Thr Cys Met Thr Thr Ala 
115 120 125 

Gin Gly Thr Ser Met Tyr Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp 
130 135 140 

Gly Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Gly Lys 
145 150 155 160 
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Phe Leu Trp Glu Trp Ala Ser Ala Pro Phe Ser Trp Leu Sex Leu Leu 
165 170 175 

Val Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu 
160 185 190 

ser Val He Trp Het Hot Trp Tyr Trp Oly Pro Ser Leu Tyr ser lie 
195 200 205 

Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val 
210 215 220 

Tyr He 



225 
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What is claimed is: 

CIATMS 

1. A method of inhibiting the replication of a 
naturally-occurring hepadnavirus, said method comprising 
the steps of: 

5 (a) introducing into the proximity of said 

hepadnavirus a nucleic acid that encodes a hepadnavirus 
mutant polypeptide, wherein said polypeptide: 

(i) comprises a first amino acid sequence 
that is substantially identical to a region of a wild 

10 type hepadnavirus core protein of at least 70 amino acids 
in length, and 

(ii) lacks a second amino acid sequence of 
said wild type hepadnavirus core protein, wherein said 
second sequence comprises the car boxyt ermina 1 three amino 

15 acids of said wild type hepadnavirus core protein and 
does not exceed 100 amino acids in length; and 

(b) allowing said mutant polypeptide to be 
expressed from said nucleic acid, wherein said mutant 
polypeptide inhibits the replication of said naturally- 

20 occurring hepadnavirus. 

2. The method of claim 1, wherein said 
polypeptide further comprises a third amino acid sequence 
that is substantially identical to a portion of a wild 
type hepadnavirus surface protein* 

25 3. The method of claim 1 or 2, wherein said 

hepadnavirus is hepatitis B virus (HBV) . 

4* The method of claim 1 or 2, wherein said 
method is used to treat an infection of hepatitis B virus 
in a patient. 
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5. The method of claim 3, wherein the 

car boxyterm i na 1 amino acid of said first: amino acid 
sequence corresponds to a position selected from the 
group consisting positions 81 to 180 of SEQ ID NO: 12, 
5 inclusive. 

6. The method of claim 3, wherein said 
carboxyterminal amino acid of said first amino acid 
sequence corresponds to a position selected from the 
group consisting of positions 171 to 180 of SEQ ID NO: 

10 12, inclusive. 

7. The method of claim 3, wherein said 
carboxyterminal amino acid of said first amino acid 
sequence corresponds to position 171 of SEQ ID NO: 12. 

8. The method of claim 3 , wherein said 

15 carboxyterminal amino acid of said first amino acid 

sequence corresponds to position 178 of SEQ ID NO: 12. 

9. The method of claim 3, wherein said second 
amino acid sequence comprises amino acids 172*183 of SEQ 
ID NO: 12. 

20 10. The method of claim 3, wherein the 

aminoterminal amino acid of said third amino acid 
sequence corresponds to a position selected from the 
group consisting of positions 1 to 112 of SEQ ID NO: 14, 
inclusive* 

25 11. The method of claim 3, wherein the 

aminoterminal amino acid of said third amino acid 
sequence corresponds to a position selected from the 
group consisting of positions 1 to 8 of SEQ ID NO: 14, 
inclusive. 
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12. The method of claim 3, wherein said 
aminoterminal amino acid of said third amino acid 
sequence corresponds to position 5 of SEQ ID NO: 14. 

13. The method of claim 3, wherein said 

5 aminoterminal amino acid of said third amino acid 
sequence corresponds to position 8 of SEQ ID NO: 14. 



carboxyterminal amino acid of said third amino acid 
sequence corresponds to a position selected from the 
10 group consisting of positions 51 to 224 of SEQ XD NO: 14, 
inclusive. 



carboxyterminal amino acid of said third amino acid 
sequence corresponds to a position selected from the 
15 group consisting of positions 112 to 224 of SEQ ID NO: 
14 , inclusive • 

16. The method of claim 3, wherein the 
carboxyterminal amino acid of said third amino acid 
sequence corresponds to position 51 of SEQ ID NO: 14* 

20 17. The method of claim 3, wherein the 

carboxyterminal amino acid of said third amino acid 
sequence corresponds to position 112 of SEQ ID NO: 14. 

18. The method of claim 3, wherein the 
carboxyterminal amino acid of said third amino acid 
25 sequence corresponds to position 224 of SEQ ID NO: 14. 



14. The method of claim 3, wherein the 



15. The method of claim 3, wherein the 
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19. A method of inhibiting the replication of a 
naturally-occurring hepadnavirus , said method comprising 

introducing into the proximity of said 
hepadnavirus a hepadnavirus mutant polypeptide, wherein 
5 said polypeptide: 

(i) comprises a first amino acid sequence 
that is substantially identical to a region of a wild 
type hepadnavirus core protein of at least 70 amino acids 
in length, and 

10 (11) lacks a second amino acid sequence of 

said wild type hepadnavirus core protein, wherein said 
second sequence comprises the carboxyterminal three amino 
acids of said wild type hepadnavirus core protein and 
does not exceed 100 amino acids in length wherein said 

15 mutant polypeptide inhibits the replication of said 
hepadnavirus . 

20. The method of claim 19, wherein said 
polypeptide further comprises a third amino acid sequence 
that is substantially identical to a portion of & wild 

20 type hepadnavirus surface protein* 
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21. A method of inhibiting the replication of a 
naturally-occurring hepadnavirus, said method comprising 
the steps of: 

(a) introducing into the proximity of said 

5 hepadnavirus a nucleic acid that encodes a hepadnavirus 
mutant polypeptide, wherein said polypeptide comprises: 

(i) a first amino acid sequence that is 
substantially identical to a region of a wild type 
hepadnavirus core protein of at least 70 amino acids, and 
10 (ii) a second amino acid sequence that is 

substantially identical to a portion of a wild type 
hepadnavirus stir face protein. 

(b) allowing said mutant polypeptide to be 
expressed from said nucleic acid, wherein said mutant 

15 polypeptide inhibits the replication of said 
hepadnavirus . 

22. A nucleic acid encoding a mutant hepatitis B 
virus (HBV) polypeptide, wherein said polypeptide: 

20 (a) comprises a first amino acid sequence that is 

substantially identical to a region of a wild type HBV 
core protein of at least 70 amino acids in length; and 
(b) lacks a second amino acid sequence of said 
wild type HBV core protein, wherein said second sequence 

25 comprises the carboxyterminal three amino acids of said 

wild type HBV core protein and does not exceed nine amino 
acids in length. 

23. The nucleic acid of claim 22, wherein the 
carboxyterminal amino acid of said first amino acid 

30 sequence is selected from the group consisting of the 

amino acids between position 174 and position 180 of SEQ 
ID NO: 12, inclusive. 
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24. A nucleic acid encoding a mutant hepadnavirus 
polypeptide, wherein said polypeptide: 

(a) comprises a first amino acid sequence that is 
substantially identical to a region of a wild type 

5 hepadnavirus core protein of at least 70 amino acids in 
length; 

(b) lacks a second amino acid sequence of said 
wild type hepadnavirus core protein, wherein said second 
sequence comprises the carboxyterminal three amino acids 

10 of said wild type hepadnavirus core protein; and 

(c) comprises a third amino acid sequence that is 
substantially identical to a portion of a wild type 
hepadnavirus surface protein. 

25* The nucleic acid of claim 24, wherein said 
15 second amino acid sequence does not exceed 100 amino 
acids in length. 

26. The nucleic acid of claim 24, wherein the 
carboxyterminal amino acid of said first amino acid 
sequence corresponds to a position selected from the 

20 group consisting of positions 71 to 180 of SEQ ZD NO: 12, 
inclusive* 

27. A nucleic acid encoding a mutant hepadnavirus 
polypeptide, wherein said polypeptide comprises: 

(a) a first amino acid sequence that is 

25 substantially identical to a region of a wild type 

hepadnavirus core protein of at least 70 amino acids in 
length; and 

(b) a second amino acid sequence that is 
substantially identical to a portion of a wild type 

30 hepadnavirus surface protein. 



17:39:34 



• 




WO 97/00698 



PCT/US96/10602 



- 66 - 



28. A polypeptide encoded by the nucleic acid of 



any one of claims 22, 24, or 27. 

29. A vector comprising the nucleic acid of any 
one of claims 22, 24, or 27. 



mutant polypeptide of claim 28 in a pharmaceutically 
acceptable carrier. 

31. A therapeutic composition comprising the 
vector of claim 29 in a pharmaceutically acceptable 
10 carrier. 



hepadnavirus is selected from the group consisting of a 
voodchuck hepatitis virus (WHV) , a hepatitis B virus 
(HBV) , a hepatitis delta virus (HDV) , a ground squirrel 
15 hepatitis B virus, and a duck hepatitis B virus (DHBV) . 



5 



30. A therapeutic composition comprising the 



32. The method of claim 1, wherein said 
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541 / 181 571 / 191 

CAC CAC GCC ACC ATG CCC CAC CTC CAT CAC TAC TCC TCA ACC AAC CTC TAT CTA TCC CTC 
gin his gly chr nee pro asp leu his asp cyr cys ser arg asn leu cyr val ser leu 
(01 / 201 631 / . 211 

CTC TTC CTC TAC CAA ACC TTC CGA CCC AAA TTC CAC CTG TAT TCC CAT CCC ATC ATC CTC 
leu leu leu cyr gin chr phe gly arg lys leu his leu cyr ser his pro lie ile leu 
m i_ 221 €91 / 231 

CCC TTT 'CCC "AAA ATT-CCT^crC CCA CTG CCC CTC ACC CCC TTT CTC CTC CCT CAC TTT ACT 
gly phe arg lys ile pro met gly val gly leu ser pro phe leu leu ala gla phe Chr 
721 / 241 751 / 251 

ACT CCC ATT TCT TCA CTG CTT CCT ACC CCT TTC CCC CAC TCT TTC CCT TTC ACT TAT ATC 
ser ala ile cys ser val val arg arg ala phe pro his cys leu ala phe ser cyr nec 
781 / 281 811 / 271 

CAT GAT CTC GTA TTC CCC CCC AAG TCT CTA CAC CAT CTT CAC TCC CTT TTT ACC CCT CTT 
asp asp val val leu gly ala lys ser val gin his leu glu ser leu phe chr ala val 
841 / 281 871 / 291 

ACC AAT TTT CTT TTC TCT TTC CCT ATA CAT TTA 
chr asa phe leu leu ser leu gly ile his leu 
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i/i 31/11 

ATC CAT ATC AAT'CCT TCT ACA CCC TTA CCC AAT CTC TAT CAT CTA CCA CAT CAT TTC TTT 
mec asp ile asn ala ser arg ala leu ala asn val cyr asp lau pro asp asp phe phe 

61 / 2' 91/31 

CCA AAA ATA CAT CAT CTT CTT ACA CAT CCT AAA CAC CCT TTA CAC CCT TAT TCC AAA TCA 
pro lys ila asp asp leu val arg asp ala lys asp ala leu glu pro cyr trp lys ser 
121 / 41 "I / 51 

CAT TCA ATA AAC AAA CAT CTT TTC ATT CCA ACT CAC TTT CTC CAT CTT ATT CAA CAC TTC 
asp se- ila lys lys *i« val lau ila ala chr his pha val asp leu ila glu asp phe 
1B1 / 61 "1 ' 11 

TCC CAC ACT ACA CAC CCC ATC CAT CAA ATA CCC CAA TCA TTA ACA CCT CTT ATA CCT CCC 
c-p gin chr chr gift gly nec his giu ile ala glu ser lau arg ala val ila pro pro 
241 / Bl *71 ' 9i 

ACT ACT ACT CCT CTT CCA CCC CCT TAT CTT ATT CAC CAC CAA CAA CCT CAA CAC ATA CCT 
rtxr chr -pro val pro pro gly cyr lau ila gin his glu giu ala glu glu ila pro 
301 / 101 331 ' "I 

TTC CCA CAT TTA TTT AAA CAC CAA CAA CAA ACC ATA CTC ACT TTC CAA CCC CAC TAT CCC 
leu gly asp leu phe lys his gin glu giu arg ile val ser phe gin pro asp cyr pro 
361 / 12' 391 / "I 

ATT ACC CCT ACA ATT CAT CCT CAT TTC AAA CCT TAT CCA AAA ATT AAC CAC CAA TCA CTC 
ile chr ala arg ile his ala his leu lys ala cyr ala lys ile asn glu glu ser leu 
421/141 451 / 131 

CAT ACC CCT ACC ACA TTC CTT TCC TCC CAT TAC AAC TCT TTA CTC TCC CCA CAA CCT CAA 
asp arg ala arg arg leu leu crp crp his cyr asn cys leu lau crp gly glu ala gla 
481 / 161 311 ' 17r 

CTT ACT AAC TAT ATT TCT CCC TTC CCT ACT TCC TTC TCA ACT CCT CAC AAA TAT ACA CCT 
val car asn cyr ila ser arg leu arg chr crp leu ear tar pro glu lys cyr arg gly 
341 / 1B1 "1 ' 191 

ACA CAT CCC CCC ACC ATT CAA CCA ATC ACT ACA CCA ATC CAC CTC CCT CAC CCA CCC CCA 
arg asp ala pro chr ile glu ala ila chr arg pro ile gin val ala gin gly gly arg 
601 / 201 «1 / 211 

AAA ACA ACT ACC CCT ACT ACA AAA CCT CCT CCA CTC CAA CCT ACA ACA ACA AAA CTT AAA 
lys chr chr chr gly chr arg lys pro arg gly leu glu pro arg arg arg lys val lys 
661 / 221 «91 / 231 

^CC ACA CTT CTC-TAT-TSCC *GA JuCA CCT TCA AAC TCCCCCCCAAGGACACCCCCTACACCC 
chr chr val val cyr gly arg arg arg ser lys ser arg gly arg arg ala pro chr pro 
721 / 241 731 / 231 

CAA CCT CCC CCC TCC CCT CTC CCA CCT ACT TCC ACC ACC CAC CAT ACA TCC TTC CCC CCA 
gin arg ala gly ser pro lau pro arg ser ser ser ser his his arg ser phe gly gly 
781 / 261 Bll / 271 

ATA CTA CCT CCC CTA ATC CCA TTA CTC CTA ACC TTT TTC TTC TTC ATA AAA ATT CTA CAA 
ile leu ala gly leu ile gly leu leu val ser pha phe leu lau ile lys xle leu glu 
841 / 281 871 / 291 

ATA CTC ACC ACC CTA CAT TCC TCC TCC ATT TCT CTC ACT TCT CCA AAC CCA AAA ATC CAA 
ila leu arg arg lau asp crp crp crp ile ser leu sar ser pro lys gly lys mec gin 
901 / 301 931 / 311 

TCC CCT TTC CAA CAT ACT CCA CCC CAA ATC TCT CCA CAT TAC CTC CCA TCT TCC CCC TCC 
cys ala phe gin asp chr gly ala gla ila ser pro his cyr val gly ser cys pro crp 
961 / 321 99X / 331 

CCA TCC CCA CCA -TTT CTT TCC ACC TAT CTC ACC CTT TTT ATC ATC TTC CTC TTA ATC CTC 
gly cys pro gly phe leu crp chr cyr leu arg leu phe ile ile phe leu lau lie leu 
1021 / 341 ♦ 1031 / 331 

CTACTAGCACCACCCTTCCTCTATCTCACGGACAACW 

leu val ala ala gly leu lau cyr lau chr asp asn gly ear chr ile leu gly lys ieu 
1081 / 361 1111 / 371 

CAATCCGCCTCCCTCTCAGCCCTTTTCTCCTCCATCTCTT^ 

gin crp ala ser val ser ala leu phe sar sar lie sar aar leu leu pro ser asp pro 
1141 / 381 H71 / 331 

AAA TCT CTC CTC CCT TTA ACC TTT CCA CTT TCA CTT ATA TCC ATC ACT TCC TCC TCT CCC 
lys aar lau val ala lau chr phe gly leu sar leu ile crp mec chr ser aar sar ala 
1201 / 401 1231 / 411 

ACCCAAACCCTCCTCACCTTAACCCAATTACCCACCCTCTCTC^ 

chr gin chr leu val chr leu chr gin lau ala chr leu ser ala leu pha cyr lys sar 

1261 / 421 

TAC 
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